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Systemin, an 18 amino acid peptide hormone and first polypeptide hormone found in plants, induces expression of de- 
fense genes in plants wounded mechanically or by predators including herbivores, insects, bacteria and viruses. The precursor for 
systemin is encoded as a 200 amino acid prosystemin molecule that has the systemin peptide sequence located near the carboxyl- 
terminus. Both a 951 bp cDNA for prosystemin and 4526 bp genomic DNA were cloned and the organization of the gene was 
determined. Transgenic plants constructed with antisense prosystemin DNA fail to mount a defensive response to wounding. 
Transgenic plants constructed with increased copy number of prosystemin genes exhibit increased resistance to wounding. Insect 
larval that feed on transgenic plants constructed with increased copy number of prosystemin genes exhibit decreased growth 
weight compared to larval that feed on wild type plants. A tomato systemin polypeptide has an amino acid sequence 
NH3-AVQSKPPSKRDPPKMQTD-COO-. 



BNSEX3CID: <WO 93 1 9079A 1 _L> 



REST AVAIUBLE COP^^ 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used lo identify Slates party to the PCI* on the front pages of pamphlets publishing international 
applications under the PCX. 



AT 


Austria 


FR 


France 


MR 


Mauritania 


AU 


Australia 


CA 


Gabon 


MW 


Malawi 


BB 


Barbados 


GB 


United Kingdom 


NL 


Netherland:i 


BE 


Belgium 


CN 


Guinea 


NO 


Norway 


BP 


Burkina. Fo^u 


CR 


Cruecc 


NZ 


New Zealand 


BG 


Bulgaria 


HU 


Hungary 


PL 


Poland 


BJ 
BR 


Benin 
Bra/il 


IE 
IT 


Ireland 
iiaty 


PT 
RO 


Portugal 
Romania 


CA 


Canada 


JP 


Jap:tn 


RU 


Ru:isian Federation 


CF 


CTuniral Arricait Republic 


KP 


Democratic People*^ Repuhlie 


SD 


Sudan 


CC 


C *ongu 




or Korea 


SB 


Sweden 


CH 


Swiucrland 


KR 


Republic of Korea 


SK 


Slovak Republic 


CI 


C'6(c dHvotre 


K2 


Kazakh:>Un 


SN 


Senegal 


CM 


CVnierutiii 


U 


Liectiienstein 


su 


Soviet Union 


OS 


CVx:t:htr»h>vai>ta 


LK 


Sri 1 «inLa 


TD 


Chad 


C2 


CVccIi Republic 


I.U 


I .uxumbourg 


TC 


'l«»go 


DE 


(iermany 


MC 


Monaco 


UA 


Ukraine 


DK 


Denmark 


MC 


Madagascar 


US 


United State:* or America 


BS 


Spain 


MI. 


Mali 


VN 


Viet Nam 


FI 


f*inland 


MN 


Mongolia 







BNSDOCID: <WO 9319079A1_L> 



wo 93/19079 



PCT/US93/02428 



- 1 - 

SYSTEMIN 
GOVERNMENT SPONSORSHIP 

This Invention was made with government support under grant number 
DCB91 04542 awarded by the National Science Foundation. The government has 
5 certain rights in the invention. 

RELATED APPUCATIONS 

This application Is a continuation-in-part of U.S. Patent Application Serial No. 
07/855,412, filed March 19, 1992, which is a continuation-in-part of U.S. Patent 
Application Serial No. 07/528,956, filed May 25, 1990. and a continuation-in-part of 
1 0 US/91 /03685, filed May 24, 1 991 . 

FIELD OF THE INVENTION 
This invention relates to methods and materials for Inducing plant defense 
mechanisms. More particularly, this invention relates to methods for Inducing the 
production of plant defense proteins, such as proteinase Inhibitors, and to methods 
15 of regulating resistance to predators, hertjivores, insect, pathogen or virus in plants 
by inducing or suppressing the expression of genes encoding systemin or 
prosystemin. 

BACKGROUND OF THE INVENTION 

Damage to crops by predators (i.e., Insects, herbivores, and pathogens, 
20 including fungi, bacteria, and viruses), results in substantial annual losses In 
agricultural production. Man has created and employed a wide range of chemicals 
In attempting to reduce damage to plant crops, Many environmental problems have 
been created by the widespread use of chemicals that may only provide a transient 
level of protection for crops. Chemicals also suffer from the disadvantage that all 
25 organisms in an area may be indiscriminately treated, causing needless damage to 
many beneficial organisms. Many chemicals are also potentially toxic to man and 
animals. 

Attempts to reduce crop damage have Included selective breeding for 
resistance, but resistance traits can frequently be controlled by many genes making 
30 It difficult (or Impossible) to genetically select a desired attribute. Decreased crop 
yields are also commonly encountered In resistance strains. Accordingly, there exists 
a strong need for compositions and processes to improve the resistance of plants 
under attack by herbivores. 

Plants have evolved inducible defensive mechanisms that respond to attacks 
35 by predators (C. A Ryan, 1 990, Ann. Rev. Phytopathol. 28:425; D.J. Bowles. 1 990, Ann. 
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Rev. Biochem, 59:873; M. Chessin and A.E. Zipf, 1990, The Botanical Review 56:193; 
D.L Dreyer and B.C. Campbell, 1987. Plant, Cell and Environ. 10:353). One 
mechanism involves systemic synthesis of serine proteinase inhibitors that are 
accumulated at distal tissue sites in plants. The proteinases can inhibit the digestive 
5 enzymes of insects and microorganisms (T.R. Green and C.A. Ryan, 1972, Science 
175:776; C.A. Ryan. 1978, TIBS 3(7):148; V.A. Milder, A.M.R. Gatehouse. S.E. 
Sheerman, R.F. Barker, D. Boulter, 1987 A/aft/re 330:160; R. Johnson. J. Narvaez. 
G.An, CA. Ryan, 1989, Proc. Natl. Acad, ScL U.S.A 86:9871). Proteinase inhibitors 
can be detrimental to the growth and development of insects from a variety of genera 
10 including Heiiothis, Spodoptera, Diabiotica and Tribolium (Ryan, supra; Broadway. 
supra; Rechsteiner, supra). Several families of polypeptides have been described that 
inhibit serine proteinases, including: the Kunitz family (e.g.. Soybean trypsin inhibitor); 
the Bowman-Birk family; (e.g., Soybean proteinase inhibitor); the Potato I and Potato 
n families; the Barley trypsin inhibitor family; and, the Squash inhibitor family. 
15 Wounding of plants by animals, including insects, and pathogens or 

mechanical damage reportedly induces transcriptional activation of proteinase inhibitor 
genes and protein synthesis (J.S. Graham, G. Hall, G. Pearce, C.A. Ryan, 1986, Planta 
169:399). The latter wound-response has reportedly been described in a variety of 
species including; tomato (J.S. Graham. G. Pearce, J. Merryweather, K. Titani, L 
20 Ericsson, C.A. Ryan, 1985, J. Biol. Ctiem. 260(1 1):6555; J.S. Graham, G. Pearce, J. 
Merryweather, K. Titani, LH. Ericsson, C.A. Ryan, 1985, J. BioL Chem. 260(1 1):6561), 
potato (C.A. Ryan, 1968. Plant PIrysioL 43:1880), alfalfa (W.E. Brown and C.A. Ryan. 
1984, Biochemistry 23:3418; W.E. Brown, K. Takio. K. Titani, C.A. Ryan, 1985, 
B/oc/7em/st/y 24:2105), cucurbits (D. Roby, A. Toppan, M.T Esquerre-Tugaye. 1987, 
25 Physiol. MoL PL Pathol. 30:6453) and poplar trees (H.D. Bradshaw, J.B. Hoflick, TJ. 
Parsons, H.R.G. Clarke, 1989, Plant MoL BloL 14:51). Wounding reportedly results in 
the rapid accumulation of proteinase inhibitors not only in wounded leaves but also 
in distal, unwounded leaves, suggesting that a signal, or signals, released from the 
wound site travels throughout the plant. Transport of these signals is mediated locally 
30 through intercellular and intracellular fluids that permeate wound or infection sites 
(Green, TR. and C.A. Ryan, Science 175:776-777. 1972) or travel systemlcally through 
the vascular system of plants (Kuc, J. and C. Presisig. Mycologia 76:767-784, 1984: 
M. Kopp, et al., Plant PhysioL 90:208-216, 1990; and K.E. Hammond-Kosack, et al., 
PhysioL Mol Plant Path. 35:495-506, 1989), Proposed wound signals include: pectic 
35 fragments derived from the plant cell wall (CA. Ryan and E.E. Fanner, 1 991 . Annu, 
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Rev. Plant Physiol. MoL Bio, 42:651); the lipid<lerived molecule, jasmonic acid (E.E. 
Farmer and CJK, Ryan, 1990, Proc. Natl. Acad. Sc/. U,S.A. 87:7713); the plant 
hormone, abscisic acid (H. Pena-Cortes, JJ. Sanchez-Serrano. R. Mertens, L 
Willmitzer, S. Prat, 1989, Proc. Natl. Acad. ScL U.S.A. 86:9851); electrical potentials (E. 
5 Davies, 1987, Plant, Cell and Environ. 10:623; J.F. Thain, H.M. Doherty, DJ. Bowles, 
D.C. Wildon, 1990, Plant, Cell and Environ. 13:569); and, more recently, an 18-amino 
acid polypeptide called systemin (G. Pearce, D. Strydom, S. Johnson, C,A. Ryan. 
1991 , Science 253:895). 

SUMMARY OF THE INVENTION 

1 0 Disclosed herein are a) the isolation and sequencing of Systemin, an 1 8-amino 

acid polypeptide and first polypeptide hormone found in plants; b) the isolation and 
sequencing of prosystemin, a precursor 200 amino acid 23kDa polypeptide; c) the 
molecular cloning of cDNA encoding prosystemin and genomic DNA encoding 
prosystemin mRNA; d) the construction of antisense vectors encoding antisense RNA 

15 inhibiting prosystemin synthesis; e) the construction of vectors containing the 
proseptemin or systemin sense nucleic acid, as well as, f) a method of enhancing the 
defense mechanism of plants. Systemin has been shown to be a powerful Inducer of 
the synthesis of wound-inducible plant defense proteins including members of 
proteinase inhibitor families, i.e., the Inhibitor I (8100 Da) and Inhibitor n (12,300 Da) 

20 families. Radioactively labelled systemin applied to a plant wound site is rapidly 
translocated to distal tissues where it induces synthesis of defense proteins. Systemin 
is represented only once In the precursor prosystemin molecule and is located dose 
to the carboxy terminus of the precursor protein. Plants expressing antisense 
prosystemin RNA exhibit a greatly reduced synthesis of wound-induced proteinase 

25 inhibitors. Transgenic plants expressing sense prosystemin RNA exhibit an Increased 
level of wound-induced proteinase inhibitors. Manduca sexta larvae exhibit lowered 
growth weight when feeding on transgenic sense tomato plants compared to those 
larvae feeding on wild type plants. 

Nucleic acid sequences of the invention are capable of encoding a systemin 

30 or prosystemin polypeptide or antisense RNA. The nucleic acids comprise a 
nucleotide sequence capable of hybridizing with the sense or antisense strand of the 
nucleotide sequence of the prosystemin cDNA or genomic DNA. The nucleic acids 
of the Invention encode prosystemin and systemin polypeptides or are antisense 
sequences which interfere with the expression of systemin or prosystemin in vivo. 

35 Systemin related polypeptides of the present invention comprise the amino acid 
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sequence R^RiQR^RgPPRiRjRgR^PPR^RiQR^Ri, wherein R^ is any amino acid, R2 is 
lysine or arginine (or any derivative thereof), Q is glutamine (or any derivative thereof), 
and P is proline (or any derivative thereof). A representative example of a systemin 
polypeptide of the invention is the amino acid sequence: NH3- 
5 AVQSKPPSKRDPPKMQTD-COO\ 

The processes of the present invention are useful for enhancing synthesis of 
defense proteins in a plant by introducing a prosystemin sense nucleic acid into a 
plant cell, or inhibiting synthesis by introducing an antisense nucleic acid. Transgenic 
plants containing the subject nucleic acids of the invention are also provided. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing aspects and many of the attendant advantages of this invention 
will become more readily appreciated as the same becomes better understood by 
reference to the following detailed description, when taken In conjunction with the 
accompanying drawings, wherein: 

15 FIGURE 1 shows preliminary purification of systemin from the extracts of 

tomato leaves by semipreparative reverse-phase HPLC as described in Example 1 , 
below. 

FIGURE 2 shows substantial purification of systemin by chromatography on an 
SCX-HPLC column as described in Example 1 , below. 
20 FIGURE 3 shows the amino acid sequence of the systemin polypeptide. 

FIGURE 4 shows Induction of defense protein synthesis, i.e., Inhibitor I (closed 
circles) and Inhibitor n (open circles), in tomato plants by a synthetic systemin 
polypeptide, as described in Example 2, below. 

FIGURE 5A shows the autoradiograph of a tomato leaf that was treated with 
25 ^^C-labeled synthetic systemin polypeptide to demonstrate transport of systemin from 
wound sites into distal plant tissues. 

FIGURE 5B shows the ^*C-labeled synthetic systemin isolated by reverse-phase 
HPLC from the distal plant tissues of FIGURE 5A. 

FIGURE 6 shows the amino acid sequence of prosystemin. 
30 FIGURE 7 shows the nucleotide sequence of cDNA encoding prosystemin with 

crosshatched underlining showing repeated sequence motffs and vertical bar 
underlining showing the location of systemin in the precursor sequence. 

FIGURE 8 shows the nucleotide sequence of the prosystemin gene. 
FIGURE 8A shows the nucleotide sequence of the prosystemin gene from 
35 position 1 to position 2100. 
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FIGURE SB shows the nucleotide sequence of the prosystemln gene from 
position 21 01 to position 4200. 

FIGURE 8C shows the nucleotide sequence of the prosystemin gene from 
position 4201 to position 4526. 
5 FIGURE 9A shows the organization of the prosystemin gene. The gene 

consists of a 104 bp 5'-untranslated region, a 4176 bp coding region composed of 1 1 
exons (vertical bars) Interrupted by 10 introns, and a 246 bp 3'-untranslated region. 
The position of systemin is indicated by a horizontal bar labelled SYS. 

FIGURE 9B shows a Southern blot analysis of the prosystemin gene. Tomato 
10 genomic DNA was isolated from leaves (as described in Example 6, below), and 5 jig 
was digested with EcoRI (lane 1); Bgl U (lane 2) or Sea I (lane 3), and 
electrophoresed on a 0.8% agarose gel that was probed with nick-translated 
prosystemin. 

FIGURE 9C shows a Southern blot analysis of the species distribution of 
15 prosystemin gene homologues, as described in Example 9, below. Genomic DNA (5 
|ig) from tomato (lane 1), potato (lane 2), tobacco (lane 3), alfalfa (lane 4). and 
Arabidopsis (lane 5) was digested with EcoRI and electrophoresed on a 0.8% agarose 
gel. The gel was blotted onto nitrocellulose and probed with nick-translated 
prosystemin cDNA. 

20 FIGURE 10 shows the organization of the prosystemin gene. Exons are 

represented by vertical bars and numbered 1 to 11, The five exon pairs are: 1 plus 
2; 3 plus 4; 5 plus 6; 7 plus 8 and 9 plus 10. 

FIGURE 1 1 shows sequence alignment of the prosystemin gene exons. The 
consensus sequence (con) is composed of those bases that occur at the same 
25 position In at least three of the five exon sequences. 

FIGURES 1 1 A shows the alignment of sequences of the first exons of each pair 
(exons 1, 3, 5. 7, and 9). 

FIGURES 1 1 B shows the alignment of the sequences of the second exons of 
each pair (exons 2, 4, 6, 8, and 10). 

30 FIGURE 12 shows sequence alignment of three repeated polypeptide 

sequences within prosystemin. Three polypeptide sequences (Rep A, Rep B and Rep 
C; each occumng once within the amino-terminal half of prosystemin), are aligned 
with the homologous sequences (Rep 2A, Rep 2B and Rep 2C; each occurring once 
within the carboxy terminal half of prosystemin). Amino acids which differ between 

35 repeats are underiined. The amino acids at the beginning and end of eadi repeat are 
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numbered from the amino terminus of prosystemin. ^ 

FIGURE 13A shows the positions of the duplicated polypeptide sequences 
within prosystemin. Prosystemin is represented by a horizontal bar with the amino 
acid residues numbered 1 to 200 from the amino terminus. Sequence elements Rep 
5 A, Rep B and Rep C and their repeats Rep 2A, Rep 2B and Rep 2C are indicated by 
hatched bars. Systemin is represented by a hatched bar labelled Sys. 

FIGURE 138 shows the location of the sequences encoding the polypeptide 
repeats within the prosystemin gene. Exons are represented by vertical bars. The 
parts of the exons encoding the polypeptide repeats are shaded. 

10 FIGURE 14 shows a sequence comparison of the intron boundary at the 3 - 

end of the exons 3 and 7. Exon sequence is underlined. The first four bases of the 
intron at the 3'-end of exon 7 have been displaced to facilitate accurate alignment of 
the homologous sequences occurring at the 3*-end of exon 3 and at the 5'-end of the 
intron between exons 7 and 8. 

1 5 FIGURE 1 5A shows a Northern blot analysis of the time course of induction of 

prosystemin mRNA and Inhibitor I mRNA after wounding, as described in Example 7, 
below. 

FIGURE 158 shows a Northern blot analysis of the distribution of prosystemin 
mRNA in various parts of an unwounded, fully-grown tomato plant, as described in 
20 Example 8. below. Total RNA was extracted from the following parts of an unwounded 
tomato plant; root (R); stem (St); petiole (R); leaf (Ue); sepal (Se); petal (Pe); stamen 
(Sm) and pistil (Pi). 

FIGURE 16A shows a Northern blot analysis of total RNA extracted from 
transgenic antisense plant 1A4. Lane 1 shows the results obtained with the sense 
25 probe and Lane 2 shows the results with the antisense probe, as described in 
Example 10, below. 

FIGURE 168 shows a graphic depiction of the levels of Inhibitor I In wounded 
F1 transgenic antisense plants (unshaded bars) and non-transformed control plants 
(solid bars), as described in Example 10, below. 
3D FIGURE 160 shows a graphical depiction of the levels of Inhibitor II in 

wounded F1 transgenic antisense plants (unshaded bars) and non-transformed 
control plants (solid bars), as described in Example 10, below. 

FIGURE 17 shows a Northern blot analysis of total RNA extracts collected at 
different times from undamaged leaves of control (nontransformed) and transgenic 
35 tomato plants during the feeding experiments with Manduca sexta larvae, as described 
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in Example 1 1 , below. 

FIGURE 1 8 shows the time course of accumulation of Inhibitor I and II proteins 
in undamaged leaves of control and transgenic tomato plants induced by the feeding 
of Manduca sexta larvae, as described in Example 1 1 , below. 
5 FIGURE 1 9 shows the growth of Manduca sexta larvae, while feeding on leaves 

of control and transgenic antisense tomato plants, as described in Example 1 1 , below. 

FIGURE 20 shows the time course of larval weight gain of Manduca sexta 
larvae feeding on leaves of wild type and transgenic tomato plants, as described in 
Example 1 1 , below. 

10 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

As used herein the following terms are used to mean: 
The term "defense proteins" is intended to include proteins that impede plant 
tissue attack or ingestion by predators, such as by herbivores, Insects, fungi, bacteria 
or viruses. Defense proteins increase resistance of plants to predator attack by acting 

15 directly to impede plant tissue attack or ingestion, or by acting Indirectly to produce 
other defense compounds from precursor materials, {e.g., by acting to induce 
enzymes in a pathway synthesizing defense compounds; or, by inducing proteins that 
regulate enzymes that synthesize defense compounds). Representative examples of 
defense proteins Include: e.g.. proteinase inhibitors, thionlns, chitinases and p- 

20 glucanases. Representative enzymes that lead to the synthesis of defense 
compounds include, e.g., casbene synthase. Representative enzymes that are part 
of a biosynthetic pathway leading to synthesis of defense compounds includes, e.g. 
enzymes in the phenylpropenoid and terpenoid pathways for synthesis of phytoalexin 
antibiotics, alkaloids and other toxic chemicals. Other predator defense proteins 

25 useful in connection with the invention disclosed herein will, of course, be apparent 
to those skilled in the art. Particularly suitable predator defense proteins Include 
inhibitors of digestive proteolytic enzymes of the attacking herbivore, such as 
proteinase inhibitors, and antibacterial, antimycotic, and antiviral compounds and the 
like. Representative proteinase inhibitor defense proteins include, e.g., the Kunttz 

30 family of trypsin inhibitors, the Bowman-Birk family of proteinase inhibttors, the Inhibitor 
I family of proteinase inhibitors, the inhibitor n family of proteinase Inhibitors, the 
barley family of trypsin inhibitors, and the squash family of proteinase inhibitors. 
Representative examples of plant proteinase inhibitors are disclosed in 
PCT/US/91/03685, a continuation-in-part application of U.S. Patent Application Serial 

35 No. 07/528,956, the disclosures of both applications are incorporated herein by 
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reference. 

The term "nucleic acid" is intended to mean natural and synthetic linear and 
sequential arrays of nucleotides and nucleosides, e.g., in cDNA, genomic DNA 
(gDNA). mRNA. and RNA, oligonucleotides, oligonucleosides, and derivatives thereof. 
5 For ease of discussion, such nucleic acids are at times collectively referred to herein 
as "constructs," "plasmids," or 'Vectors." Representative examples of the nucleic acids 
of the invention include bacterial plasmid vectors such as expression, cosmid, and 
cloning and transformation vectors (e.g., pBR322, A,, Ti, and the like), plant viral 
vectors (e.g., modified CaMV and the like), and synthetic oligonucleotide molecules 
1 0 such as chemically synthesized RNA or DNA. 

The term "encoding" is intended to mean that the subject nucleic acid may be 
transcribed and translated into the subject protein in a cell, e.g., when the subject 
nucleic acid is linked to appropriate control sequences such as promoter and 
enhancer elements in a suitable vector (e.g., an expression vector) and when the 
1 5 vector is introduced into a cell. 

The term "polypeptide" is used to mean three or more amino acids linked In 
a serial array. 

The term "antisense DNA" is used to mean a gene sequence DNA that has a 
nucleotide sequence homologous with the "sense strand" of a gene when read in a 

20 reversed orientation, /.e.. DNA read into RNA in a 3' to 5' rather than 5* to 3* direction. 
The term "antisense RNA" is used to mean a RNA nucleotide sequence (e.g., encoded 
by an antisense DNA or synthesized complementary with said antisense DNA). 
Antisense RNA is capable of hybridizing under stringent conditions with an antisense 
DNA. The antisense RNA of the invention is useful for inhibiting expression of a 

25 "target gene" either at the transcriptional or translational level. For example, 
transcription of the subject nucleic acids may produce antisense transcripts that are 
capable of inhibiting transcription by inhibiting initiation of transcription or by 
competing for limiting transcription factors; or, the antisense transcripts may inhibit 
transport of the 'target RNA"; or. the antisense transcripts may inhibit translation of 

30 "target RNA". 

The term "sense strand" is used to mean the single stranded DNA molecule 
from a genomic DNA that is transcribable and translatable into the polypeptide 
product of the gene. The term "antisense strand" is used to mean the single strand 
DNA molecule of a genomic DNA that is complementary with the sense strand of the 
35 gene. 
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The term "capable of hybridizing under stringent conditions" is used to mean 
annealing a first nucleic acid to a second nucleic acid under stringent conditions 
(defined below). For example, the first nucleic acid may be a test sample, and the 
second nucleic acid may be the sense or antisense strand of a prosystemin gene. 
5 Hybridization of the first and second nucleic acids is conducted under stringent 
conditions, e.g., high temperature and/or low salt content, which tend to disfavor 
hybridization of dissimilar nucleotide sequences. A suitable protocol involving 
hybridization in 6 X SSC, at 42*C in aqueous solution followed by washing with 1 X 
SSC. at 55X in aqueous solution is provided in the illustrative examples below. 

1 0 (Other experimental conditions for controlling stringency are described in Maniatis, T., 
et aL, Molecular Cloning; A Laboratory Manual, Cold Springs Harbor Laboratory, Cold 
Springs, NY, 1982, at pages 387-389; and also in Sambrook. Fritsch, and Maniatis, 
Molecular Cloning; A Laboratory Manual, Second Edition, Volume 2, Cold Springs 
Harbor Laboratory, Cold Springs, NY, 1 989, pages 8.46-8.47.) 

15 The term "fragment" when used herein with reference to nucleic acid (e.g., 

cDNA, genomic DNA, /.e., gDNA) is used to mean a portion of the subject nucleic acid 
such as constructed artificially (e.g., through chemical synthesis) or by cleaving a 
natural product into a multiplicity of pieces (e.g., with a nuclease or endonudease to 
obtain restriction fragments). 

20 The term "synthetic oligonucleotide" refers to an artificial nucleic acid (e.g., a 

chemically synthesized nucleic acid) having 9 or more nucleotides. 

The term "systemin polypeptide" is used to mean a polypeptide having an 
amino acid sequence RiR^QRiRaPPR^RgRaRiPPRgRiQRiRi, wherein R, is any amino 
acid, Rg is lysine or arginine (or derivative thereof), Q is glutamine (or derivative 

25 thereof), and P is proline (or any other derivative thereof), e.g., the systemin 
polypeptide of FIGURE 3; namely, NH3-AVQSKPPSKRDPPKMQTD-COO-. Skilled 
artisans will recognize that through the process of mutation and/or evolution, 
polypeptides of different lengths, e.g. . with insertions, substitutions, deletions, and the 
like, may have arisen that are related to the systemin polypeptide of the invention by 

30 virtue of: a) amino acid and/or nucleotide sequence homology; b) a defensive 
function in regulating gene expression in response to predators, pathogens, and 
mechanical injury; and/or, c) the organization of the genomic DNA, as described in 
Example 6, below. Representative examples of systemin family members in tomato 
and potato are provided in Example 6-9 (below), and illustrative methods for 

35 identification of systemin family members in other species, genra, and families of 
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plants are also provided in Examples 6-9 (below). 

The term "systemin nucleic acid" is used herein to refer to that subset of 
nucleic acids capable of encoding a systemin polypeptide. 

The term "prosystemin polypeptide" is used to mean a precursor polypeptide 
5 capable of giving rise to a systemin polypeptide. A representative example is provided 
by the prosystemin polypeptide encoded by the cDNA of FIGURE 7 or the coding 
region of the genomic DMA of FIGURES 8A. 8B, and 8C. Prosystemin polypeptide is 
capable of being cleaved {e.g., chemically or enzymatically) to give rise to systemin. 
A representative method for identifying prosystemin genes in different species of 

10 plants is provided in Example 9, below. 

The term "prosystemin nucleic acid" is used herein to refer to that subset of 
nucleic acids capable of encoding a prosystemin polypeptide. 

Embodiments of the invention described and illustrated below provide systemin 
and prosystemin polypeptides, nucleic acids encoding systemin and prosystemin 

16 mRNA. cDNA. and genomic DNA, and. including 5' regulatory sequences controlling 
transcription of prosystemin gDNA into mRNA. The subject nucleic acids of the 
invention are capable of encoding prosystemins that are constitutively synthesized at 
a low level and wound-lnducible to a high level (see illustrative Example 7, below). 
Would-inducible and constitutive low-level expression is provided by regulatory 

20 elements within 3000 bp of the 5' region of the systemin gene sequence, the first 1 04 
nucleotides of which are shown in FIGURE BA. . Promoter, enhancer, and other 
regulatory elements within the 3000 bp 5' region are useful for insertion into 
recombinant plasmids and vectors for controlling gene expression in plants, (/.e,, 
genes other than prosystemin). Representative examples of genes that may be linked 

25 to the 5' regulatory elements of prosystemin include; genes encoding storage or 
nutritionally important proteins, such as vegetative storage proteins, seed storage 
proteins, tuber storage proteins and the like; and, genes encoding other plant defense 
genes. Le., other proteinase Inhibitors Bt toxen, and the like; genes encoding 
regulatory enzymes for metabolic and defensive processes, including phenylalanine 

30 amines, HMG CIA reductase and the like; genes encoding commercially important 
enzymes in plant syspension culture, such as proteinases, lipases, and the like; and, 
genes that regulate flower color. 

Purification and physical properties of a representative systemin polypeptide 
are disclosed (Example 1 . below). Skilled artisans will recognize that the relatively 

35 high proportion of hydrophilic amino acids in the prosystemin polypeptide suggest a 
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variety of conventional approaches to purification that may be used to purify a natural, 
recombinant, or synthetic prosystemin polypeptide, (e.g., ion exchange 
chromatography, affinity chromatography, specific ion precipitation, and the like). 

The subject amino acid sequence of prosystemin disclosed herein provides 
5 amino acid sequences that may be used to construct synthetic peptides of 
prosystemin or systemin; or, alternatively they may be used to instruct sites at which 
cleavage of a prosystemin polypeptide will liberate a systemin (e.g., enzymatic 
cleavage sites in a natural prosystemin or a chimeric recombinant prosystemin protein. 
In the latter case a chimeric recombinant prosystemin polypeptide may be produced 

10 In an expression system, the chimeric protein purified, and then systemin liberated 
from the chimeric protein by enzymatic cleavage.). Cleaving a prosystemin 
polypeptide at boundary amino acids produces systemin, e.g., cleaving the 
prosystemin of FIGURE 7 at both Lsu^j^-Ala^-^^, (e.g.. cleaving with an Leu-Ala-specific 
endopeptidase; abbreviated, LA peptidase) and at Asp^^-Asn^Qj (e.g.. cleaving with 

1 5 an Asp-Asn-specific endopeptidase; abbreviated. DN peptidase). As an alternative to 
the LA peptidase, a prosystemin polypeptide may also be cleaved by suitable 
enzymes at other upstream sites such as Arg^^s-Glu^ye or GIUi^e-Asp^T^; followed by 
sequential cleavage of the product with an N-termlnal peptidase, i.e., until the LA 
residues are reached and cleaved. In a similar manner and as an alternative to a DN 

20 peptidase, a carboxypeptidase or carboxydipeptidase may be used to sequentially 
remote amino acids, from the carboxy-terminus until the DN residues are reached and 
cleaved. Those skilled in the art will recognize that a suitable LA-specific peptidase(s) 
may be isolated from plant tissues, e.g.. by using natural (or synthetic) polypeptide 
substrates having the prosystemin-systemin boundary amino acid sequences (e.g., 

25 L-A and D-N) and assaying for the production of systemin biological activity. In one 
such Illustrative example, a recombinant prosystemin chimeric protein may be 
synthesized by an expression system and used as a substrate in enzymatic assays 
to identify and Isolate the LA and/or DN peptidase(s). 

Those skilled in the art will recognize that the subject prosystemin amino acid 

30 sequence may be used for constructing proteinase Inhibitors specific for the LA and/or 
DN peptidases, and such inhibitors may be useful for inhibiting systemin production 
from prosystemin; thereby inhibiting systemin activation of defense protein production 
in plants. Skilled artisans will also recognize that LA and DN peptidase may be 
selected with enhanced ability to liberate systemin from prosystemin (e.g. LA and DN 

35 enzymes having increased turnover number, decreased Km. increased Vmax. or 
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decreased sensitivity to feedback inhibition, and the like). Strains of plants may either 
be selected, or constructed {i.e., as transgenic plants), having increased LA and/or DN 
peptidase activity. The subject plants may exhibit increased resistance to predators. 
The subject systemin polypeptides of the present invention may also be used 
5 for identifying and isolating systemin receptors from plant cells. Those skilled in the 
art will recognize that the subject polypeptides can be labeled (e.g., with a radioactive 
label) and conjugated to a photochemical crosslinking agent. The subject conjugated 
and radiolabeled polypeptides bind to the cellular systemin receptor and 
photochemical activation forms covalent bonds between the polypeptide and Its 

1 0 receptor. When the receptor-polypeptide complex is extracted from the cell it may be 
isolated and identified by virtue of its label, e,g., the molecular size may be 
conveniently determined by SDS-PAGE and autoradiography. The subject 
polypeptides of the invention may also be useful in ligand affinity chromatography for 
isolating systemin receptors. 

1 5 Embodiments of the invention provide processes for enhancing or inhibiting 

synthesis of a defense protein in a plant by introducing the subject nucleic acids of 
the invention into a plant cell. In one representative example enhanced defense 
protein production may be achieved by inserting prosystemin (or systemin) nucleic 
acid in a vector downstream from a promoter sequence capable of driving constitutive 

20 high-level expression in a plant cell. In the latter case, when the subject vector is 
introduced into a plant cell the cells containing one or more copies of the subject 
nucleic acid may exhibit increased synthesis of systemin. When grown into plants the 
transgenic plants may exhibit increased synthesis of defense proteins, and Increased 
resistance to herbivores, as discussed in more detail In Example 1 1 , below. 

25 In another embodiment, the invention provides processes for inhibiting 

synthesis of defense proteins in a plant by inserting prosystemin antisense nucleic 
acid in a vector downstream from a promoter sequence. When the latter construct is 
introduced into plant cells the cells containing one or more copies of the subject 
nucleic acid may exhibit decreased synthesis of defense proteins. A representative 

30 example of a prosystemin antisense vector, and process for inhibiting synthesis of 
defense proteins is provided In Example 10, below. 

Transgenic plants containing the subject antisense nucleic acids of the 
invention are useful for: a) identifying other mediators that may be present In the 
prosystemin molecule, (e.g., other mediators that may induce expression of defense 

35 proteins or differentiation); b) establishing the extent to which any specific insect 
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and/or pathogen is responsible for damage of a particular plant. In the latter case the 
transgenic plants of the invention are useful for assessing the importance of systemin 
defense mechanisms in production of a plant as a crop. 

In still other embodiments, the invention provides transgenic plants constructed 
5 by introducing a subject nucleic acid of the invention into a plant cell, and growing the 
cell into a callus and then into a plant; or, alternatively by breeding a transgenic plant 
from the subject process with a second plant to form an F1 or higher hybrid (/.e.. F2), 
The subject transgenic plants and progeny may be used to find those plants that 
contain extra copies of the subject nucleic acid of the invention, and Increased 

1 0 expression of prosystemin or systemin. A representative example of a process for 
producing such a transgenic plant, and breeding it to obtain F1 offspring is provided 
below in Example 10. 

Those skilled in the art will recognize the agricultural advantages inherent In 
plants constructed to have either increased or decreased expression of systemin 

1 5 polypeptide. For example, such plants may have increased resistance lo attack by 
predators, insects, pathogens, microorganisms, herbivores, mechanical damage and 
the like. Skilled artisans will also recognize that chemical agents may be developed 
that will mimic or induce systemin activity (e.g., in a manner similar to methyl 
jasmonate induction of systemin activity), and these chemical agents may be useful 

20 when sprayed on plants in maximizing crop resistance to herbivores, pathogens, and 
mechanical damage. Representative examples of plants in which the process may be 
useful include (but are not limited to) tomato, potato, tobacco, corn, wheat, rice, 
cotton, soybean, alfalfa, rape, poplar trees, pine and fir trees and the like. 

The subject nucleic acids of the invention are also useful as oligonucleotide 

25 probes (e.g., ^P-labeled synthetic oligonucleotides constructed to complement 
portions of prosystemin nucleotide sequence), and restriction fragment probes (e.g., 
end-labeled restriction fragments of prosystemin cDNA), in Northern and Southern 
blots for selecting and screening among plants to Und natural and mutant strains with 
increased prosystemin expression and/or genomic copy number. This screening 

30 procedure can be useful for identifying plant strains with increased resistance to attack 
by predators, herbivores, insects, bacteria, fungi, viruses, mechanical damage, and the 
like. 

The subject polypeptides of the invention are useful for inducing monoclonal 
and polyclonal antibodies that may be used in immunoassays to detect the presence 
35 or amount of a prosystemin or systemin polypeptide in plant tissues, extracts, and 
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fluids (e.g. see E. Harlow and D. Lane, Antibodies: A Laboratory Manual, Cold Springs 
Harbor Laboratory, Cold Springs. N.Y. 1988). The latter immunoassays may prove 
useful for identifying natural and mutant strains of plants with increased levels of 
prosystemin or systemin. Strains exhibiting increased levels of the subject 
5 polypeptides may have increased resistance to attack by herbivores, /.e., insects, 
bacteria, fungi, and viruses. 

Systemin Is a primary polypeptide signal mediating wound-inducible expression 
of defense genes in distal leaves and, therefore, is the first example of a peptide 
hormone found in plants. The expression of a recombinant antisense genetic 

10 expression construct {i.e., containing an antisense prosystemin cDNA) resulted in an 
almost complete suppression of systemic wound induced defense protein synthesis 
in plants. The latter finding provides evidence that systemin is an integral component 
of the systemic signal transduction system in plants that induces defense protein 
synthesis in response to attack by predators and the like. It is considered most likely 

15 that systemin is the first member to be identified in a systemin family of plant 
polypeptide hormones. It is considered highly likely that members of the systemin 
family may regulate developmental events in the meristems, flower tissues, and fruit 
of plants, e.g., tomatoes and potatoes. Other members of the systemin family may 
be identified by virtue of their amino acid or nucleotide sequence homology with 

20 prosystemin or systemin, or by their ability to hybridize with the subject prosystemin 
or systemin nucleic acids of the invention. (In this regard, the nucleotide sequences 
of the axons identified in Example 6, below, may prove useful as oligonucleotide 
probes for identifying other systemin family members.) In this case the ability of a 
DNA or RNA to hybridize with the nucleic acid of the invention under conditions of 

25 reduced stringency, (e.g., a suitable protocol involving hybridization in 6 X SSC, at 
42°C in aqueous solution followed by washing with 1 X SSC, at 55°C in aqueous 
solution) will be considered a preliminary indication that the DNA or RNA contains a 
systemin family member. The DNA or RNA may then be sequenced and the sequence 
compared with the sequence of systemin and prosystemin. (Experimental conditions 

30 for controlling stringency are also described in Maniatis. T., et al., Molecular Cloning; 
A Laboratory Manual, Cold Springs Harbor Laboratory, Cold Springs, NY, 1982, at 
pages 387-389; and, also in Sambrook, Fritsch, and Maniatis. Molecular Cloning; A 
Laboratory Manual, Second Edition, Volume 2, Cold Springs Harbor Laboratory, Cold 
Springs, NY, 1989, pages 8.46-8.47.) Systemin family members may be recognized 

35 by virtue of about 50% to about 1 00%, or more preferably about 70% to about 1 00%, 
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and most preferably about 80% to about 100% homology at the amino acid or 
nucleotide level, /.e., over a stretch of about 5 or more amino acids or about 15 or 
more nucleotides. 

The foregoing may be appreciated more fully by reference to the following 
5 representative examples of the subject compositions and methods provided by the 
invention. 

EXAMPLE 1 

Isolation and sequencing of the systemin polypeptide. 
Oligogalacturonides were initially considered to be primary candidates as 

10 systemic signals for the wound response because they elicit synthesis of antibiotic 
phytoalexins in plant cells near the sites of infections {10, 11). Oligogalacturonides 
are released by pectin-degrading that are not found In tomato leaves. In addition, 
when labeled a-1, 4-oligogalacturonides were applied to wound sites on tomato 
plants, they were not found to be mobile (E.A. -H. Baydoun and S. Fry, Planta 165, 

1 5 269 1 985). Thus, oligogalacturonides are probably not involved as systemic mediators 
of signal transduction in plants, at least with respect to induction of proteinase 
inhibitor genes In response to wounding. 

A search was initiated for systemic signals inducing proteinase Inhibitor I and 
n genes in tomato leaf extracts. This search led us to identify a polypeptide in tomato 

20 leaves that is free of carbohydrates and induces proteinase inhibitor activity when 
supplied to young tomato plants. The polypeptide was purified using high- 
performance liquid chromatography (HPLC. see Materials and Methods, below). 
Inducing activity of the polypeptide was assayed by cutting the petioles of young 
plants and introducing eluted fractions from column separations into the cut over a 

25 period of 30 min. The plants were subsequently transferred to small vials of water, 
incubated under constant light for 24 hours as described (C.A. Ryan. Plant Physiol. 
54, 328. 1974), and the amount of proteinase Inhibitor I and n in the leaf juice was 
quantified by radial immunodiffusion in agar gels that contained rabbit antiserum to 
Inhibitor I or Inhibitor n (C.A. Ryan, Anal, Biochem. 19, 434, 1967; R. Trautman. K. 

30 Cowan, G. Wagner, Immunochemistry 8. 901 . 1 971). Over 30,000 young tomato plants 
were assayed over a 2.5 year period. With the use of this protocol, slightly more than 
1 |ig of an active factor {i.e., systemin) was isolated from approximately 60 pounds of 
tomato leaves. 

The elution profile of the preliminary extract of tomato leaves (FIGURE 1 ) was 
35 complex. Several fractions exhibited proteinase inhibitor inducing activity but one 
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peak (FIGURE 1 ) was selected for further purification because It contained the highest 
activity and the best yield from the purification. 

After several additional purification steps (see Materials and Methods, below), 
a major peak that possessed high specific activity was eluted from a strong cation 
5 exchange (SCX) HPLC column (FIGURE 2). The properties of the eluted material 
resembled those of a polypeptide, that is, absorbance in the spectral region 
appropriate for peptide bonds, total loss of activity and recovery of free amino acids 
after acid hydrolysis, partial loss of activity in the presence of trypsin and other 
proteolytic enzymes, and a positive assay result with bicinchoninic acid (P.K. Smith 

10 et al., Ana/. Biochem. 150, 76, 1985). Total amino acid analysis of the bioactive peak 
eluted from the SCX-HPLC (step 5, Materials and Methods, below) was determined as 
described below. The amino acid sequence analysis of the active component 
(conducted as described below; see. Materials and Methods) identified its length and 
determined the sequence of FIGURE 3: NH3-AVQSKPPSKRDPPKMQTQTD-COO"). No 

15 significant similarrties were found to known protein sequences and the polypeptide 
was named "systemin" (Protein Identification Resource release 26; Pearson/Lipman 
PASTA program at the Molecular Biology Computer Research Resource, Harvard 
Medical School). The sequence is a palindrome: xxQxBPPxBBxPPBxQxx (x, any 
amino acid residue; B, Lys or Arg; Q, Gin; P, Pro). A synthesized polypeptide of 

20 identical sequence to the systemin sequence (prepared as described below; see, 
Materials and Methods) eluted from the C1 8 (step 2) column with the same retention 
time as the native polypeptide. 
Materials and Methods: 

Purification and isolation of the polypeptide inducer of defense proteins: 
25 Step 1: Approximately 2 kg of tomato leaves Lycopersicon esculentum (v. 
Castlemart) were harvested from 20-day-old plants, grown under cycles of 17 hours 
light at 28°C and 7 hours dark at 1 8°C. Leaves were homogenized in a Waring 
blender for 5 min with distilled water (total volume of 4 liters) and filtered through four 
layers of cheesecloth. The liquid was adjusted to pH 4.5 with HCI and centrlfuged at 
30 lOOOg for 10 min. The supernatant was adjusted to pH 6.1 with ION NaOH, 
centrrfuged at 10,000g for 10 min at 20^0, and decanted through Whatman #4 filter 
paper. The filtrate was chromatographed on DEAE cellulose, followed by reversed- 
phase CI 8 flash chromatography, Sephadex G25 gel filtration, and then CM Sephadex 
chromatography. 

35 The DEAE cellulose column (Whatman DE52. 5.9 cm by 15 cm) was 
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equilibrated in 1M ammonium bicarbonate and washed exinaustively with distilled 
water. The material eluting in the void volume was collected and stored overnight at 
4''C. TFA was added dropwise to the stored elute to a final concentration of 0.2% 
(v/v); the solution was then clarified by centrifugation at 20,000g for 5 min at room 
5 temperature. The supernatant was loaded onto a reversed-phase flash column (C1 8, 
40nm, 3 cm by 25 cm) previously equilibrated with aqueous 0.1 % TFA. The column 
was eluted with the use of compressed nitrogen at 8 psi. After the sample was 
loaded, the column was washed with 200 ml 0.1 %TFA; the retained material was then 
eluted with successive washes of 20, 40. and 60% methanol In 0.1% TFA. The 

10 methanol was removed with a rotary evaporator and the remaining liquid was frozen 
and lyophilized. Two kilograms of leaf material yielded about 1 g of crude material 
containing systemin. The procedure was repeated 15 times. Samples (approximately 
4 g) of crude material dissolved in 20 ml water and adjusted to pH 7.8 with 10M 
ammonium hydroxide were loaded onto a G25 Sephadex column (4 cm by 44 cm) 

1 5 that was equilibrated with 50 mM ammonium bicarbonate, pH 7.8. The material eluting 
at and just after the void volume was recovered and lyophilized. Four identical runs 
through the entire procedure produced 1 .25 g of partially purified systemin. The 1.25 
g was dissolved In 500 ml H^O, the pH was adjusted to 6 with 1M NaOH, and the 
sample was applied to a CM Sephadex column (2 cm by 17 cm) and washed with 

20 0.01 M potassium phosphate, pH 6. The activity was retained by the CM Sephadex. 
eluted with 250 mM ammonium bicarbonate, and lyophilized. The total yield of 
proteins in this step was 190 mg. 

Step 2: The active fraction (1 90 mg) recovered from step 1 was dissolved in 1 0 
ml 0.1% TFA, centrlfuged at 20.000g for 5 min, filtered, and chromatographed on a 

25 reversed-phase CI 8 column. Five microliters of each 2-ml eluted fraction was diluted 
to 380 jil with 1 54 mM sodium phosphate, pH 6.5, and assayed for proteinase Inhibitor 
I inducing activity (x in Figure 1). Four plants were assayed per fraction. The material 
was injected into a semi-preparative reversed-phase CI 8 column (Vydac, Hesperia, 
CA, Column 218 TP510, 10mm by 250mm, 5-|im beads, 300A pores). Solvent A 

30 consisted of 0.1% TFA In water. Solvent B consisted of 0.1% TFA in acetonitrile. 
Samples were injected in solvent A and, after 2 min, a 90 minute gradient to 30% 
solvent B was begun for elution. The flow rate was 2 ml/min and eluted peaks were 
monitored at 225 nm. Several peaks of biological activity were found (as described 
below). The major peak of activity (shown in black in Figure 1) resided in tubes 43 

35 to 46, which were pooled and lyophilized. Total protein content of the pooled factions 
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was estimated at 2.5 mg. 

Step 3: The total material recovered in step 2, above (2,5 mg). was subjected 

to strong cation exchange HPLC on a poly-SULFO-ETHYL Aspartamide (SCX) column 
(4.6 mm by 200 mm. 5 iim, The Nest Group. Southborough, MA) with the use of the 
5 following solvent systems: Solvent A. 5mM potassium phosphate, pH 3, in 25% 
acetronitrile; solvent B, 5mM potassium phosphate, 500 mM potassium chloride in 
25% acetonitrile, pH 3. The sample was dissolved in 2 ml of solvent A. filtered, and 
applied to the column. After a 5-minute wash with solvent A, a 60-min gradient to 50% 
B was applied. The flow rate was 1 ml/min. and the elution profile was monitored by 
10 absorbance at 210 nm. The active fractions, tubes 35 to 38. were pooled and 
reduced in volume to 1 ml by vacuum centrifugation. 

Step 4: The pooled fractions from step 3 were subjected to reverse-phase CI 8 
HPLC in lOmM potassium phosphate, pH 6. Chromatography was performed on a 
Beckman Ultrasphere Ion pair column (4.6 mm by 250 mm, CIS, 5 |im). Solvent A 

15 was 10 mM potassium phosphate, pH 6. and solvent B was 10 mM potassium 
phosphate, pH 6, containing 50% acetonitrile. The active fractions, tubes 39 to 42, 
were pooled and vacuum centrrfuged to a final volume of 1 ml. This fraction was 
applied to the same column as the previous run but under the solvent and gradient 
conditions of step 2. The sample was adjusted to pH 3 with TFA, filtered through a 

20 0.45-jim syringe filter and chomatographed at a flow rate of 1 ml/min. The peaks of 
protein were detected at 21 2 nm. The fractions containing activity, eluting at 53.5 to 
56.5 min, were pooled and vacuum centrifuged to a volume of 1 ml. 
Sfep 5: The active fraction from step 4 was subjected to SCX-HPLC with the 
same column and conditions as used in sfep 3, except that the gradient was 

25 shallower, i.e., the column was run at 0% B for 5 min at which time a gradient to 30% 
B In 120 min was started. Fractions (0.5 ml) were diluted as In FIGURE 1 and assayed 
for proteinase Inhibitor I inducing activity (x in Figure 2). The biologically active 
fractions of the system peak (shown in black in Figure 2) were collected and analyzed 
for amino acid content and sequence. The profile was detected by absorbance at 21 0 

30 nm. Fractions eluting at 76 to 78.5 min were pooled and vacuum centrifuged to 
reduce the volume to 1 ml. 

Step 6: The step 5 fraction was desalted on a CI 8 HPLC column under the 

conditions of step 2. A 60-minute gradient to 30% solvent B was employed. The 
fractions containing the activity peak eluted at 55.0 to 58.0 min and were pooled and 
35 concentrated by vacuum centrifugation to 0.5 ml. The sample contained 
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approximately 1 |ig of protein, as estimated by amino acid content after acid 
hydrolysis. The biological activity of the sample had the potential to induce maximal 
accumulation of proteinase inhibitors in 40,000 tomato plants, (/.e., approximately 
40,000-fold purified). This sample was used for amino acid analysis and sequence 
5 determination. 

Amino acid analysis: 

The bioactive peak eluted from the SCX-HPLC column {step 6) was dried in 6 
by 50 mm glass tubes and hydrolyzed in the HCI vapor. The hydrolysates were 
derivatized with phenylisothiocyanate and analyzed by reverse-phase chromatography 
10 on 30 cm by 0.39 cm columns (Picotag, Millipore) according to the manufacturer's 
suggestions. 

Amino acid sequence analysis: 

The amino acid sequence of the bioactive peak-eluted from the SCX-HPLC 
column {step 6) was determined by established methods (DJ. Strydom et al., 

1 5 Biochemistry 25, 945, 1 985; B.A. Bidllngmeyer, S.A. Cohen, T.L Tannin, J. Chromatogr. 
336, 93, 1 984). Briefly, sequencing was performed on a Beckman model 890 spinning 
cup instrument, equipped for microsequencing, as recommended by the manufacturer, 
except that 0.1% water was added to the anhydrous heptafluorobutyric acid (HFBA) 
and 0.1% ethanethiol was added to the 25% trifluoracetic acid (TFA). Identification of 

20 the phenylthiohydantoin amino acids was by reverse-phase chromatography on an 
octadecylsilane column (IBM, Inc.; 30 cm by 0.46 cm, 3-jim particle size) with the use 
of the gradient system described (D.J. Styrdom et al., supra). Abbreviations for the 
amino acid residues are: A, Ala; C, Cys; D. Asp; E. Glu; F. Phe; G. Gly; H, His; I, lie; 
K. Lys; L, Leu; M, Met; N, Asn; P, Pro; Q. Gin; Arg; S, Ser; T, Thr; V, Val; W, Trp; 

25 and Y, Tyr. 

Synthesis of a synthetic systemin polypeptide: 

A polypeptide corresponding to residues 2 through 18 (FIGURE 3) was 
synthesized with the use of 9-fluorenylmethyl chloroformate (F-moc) solid phase 
chemistry with an Applied Biosystems Inc. Model 431 A synthesizer on a p-methyl 

30 benzyhyrylamine resin according to the manufacturer's protocol. The F-moc derivative 
of [^^C] Ala (New England Nuclear) was synthesized (J. Stewart and J. Young, Solid 
Phase Peptide Synthesis (Pierce Chemical Co., Rockford, IL, ed. 2, 1984). pp.67-68) 
and added to the NH2- terminal residue of the polypeptide; the [^*C] Ala-polypeptide 
was then cleaved from the resin. The radioactive peptide was purified by CI 8 HPLC. 

35 The specific activity of the synthetic systemin polypeptide was 1 9.0 jiCi/jimol. 
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EXAMPLE 2 

Systemic translocation of radiolabeled systemin polypeptide. 
The synthetic systemin polypeptide (described above) was tested for bioactivlty ^ 
and was found to be as effective as the native systemin polypeptide (purified above 
5 to step 6) for inducing the synthesis and accumulation of both Inhibitor I and n ? 
proteins when supplied to the cut stems of young tomato plants (FIGURE 4). About 
40 fmol of the polypeptide per plant was required to produce half maximal 
accumulation of Inhibitors I and n. which represents about 1 0^ times more activity on 
a molar basis than the previously reported PlIF oligogalactunonide Inducers derived 
10 from plant cell walls. The coordinate induction of synthesis of Inhibitor I and Inhibitor 
n proteins in response to the systemin synthetic polypeptide (FIGURE 4) is similar to 
a normal plant wound response that is transcriptionally regulated. This suggests that 
the polypeptide is activating the same signal transduction pathway activated by 
wounding (T. Green and C.A. Ryan, 1972, supra] J.S. Graham et al., 1986, supra), by 
15 oligosaccharides (P. Bishop et al., 1981, supra; M. Walker-Simmons et al., 1983, 
supra)t and by methyl jasmonate (E.E. Farmer et al., 1990, supra). 

The systemin polypeptide, unlike the prior reported oligogalacturonides, is 
transported out of wounds to distal tissues. ^*C-labeled polypeptide was synthesized 
(as described above. Example 1) and placed on fresh wounds of tomato plants. 
20 Within 30 min the radioactivity had moved throughout the leaf, and within 1 to 2 hours 
radiolabeled systemin was identified by HPLC (FIGURE 5) in the phloem exudate 
(expressed from the plant according to the method described by R,W. King and J.A. 
ZeeVaart, Plant PhysioL 53, 96, 1974). The polypeptide was named "systemin" 
because of its mobility through phloem. 
25 As well as being inducible by wounding in leaves, the genus for proteinase 

Inhibitors I and II are developmentally regulated in the meristems, flower tissues, and 
fruit of tomato species, and in potato tubers. Thus, it is likely that these 
developmental events may be mediated by systemin or by similar polypeptides that 
are members of the systemin family. : 
30 FIGURE 4 shows the results of experiments in which systemin synthetic 

polypeptides inducted synthesis and accumulation of proteinase Inhibitor I (•) and n 
(O) in the excised leaves of young tomato plants. The leaves were incubated in 
solutions containing the synthetic systemin polypeptide and the proteinase inhibitors 
were assayed as described above in Example 1 . Each data point was obtained from 
35 assays of the leaves of 36 tomato plants. 
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EXAMPLE 3 

Molecular cloning prosystemin. 
A prosystemin cDNA was isolated by screening a primary cDNA library 
synthesized from tomato leaf mRNA as follows: 
5 Poly A + mRNA was purified from tomato leaves using oligo dT columns 

(Phannacia). cDNA was syntliesized using the Stratagene cDNA synthesis system 
and was cloned into lambda ZAP vector arms (Stratagene). Approximately 80,000 
primary library recombinants were screened, by duplicate plaque lifts, using a 
degenerate oligonucleotide probe, termed "SPI", based on the amino acid sequence 

10 of the carboxy terminus of systemin {I.e. PPKMQTN; amino acids 190 to 196, as 
numbered in FIGURE 6, excluding the last nucleotide residue of the Asp^gg codon). 
The hybridization conditions for screening were: 6X SSC; IX Denhardts solution; 100 
jig/ml, yeast tRNA; 0.5% sodium pyrophosphate and approximately 4x10^ cpm of ^P- 
end-labeled SP1 . Hybridization was at 37°C for 36 hr. The filters were washed in 

15 several changes of 5X SSC, 0.1% SDS at 48''C for one hour. Approximately 50 
positive clones were identified and rescreened using a second degenerate 
oligonucleotide, termed "SP2", corresponding to the amino terminus of systemin 
(AVQSKP; amino acids 179 to 184, as numbered In FIGURE 6, excluding the last of 
the Pro,84 codon). The hybridization and wash conditions were Identical to those 

20 used for SPI except that the wash temperature for SP2 was 40"^. Of the Initial 
positive clones only one hybridized to the SP2 probe. Fragments of the prosystemin 
cDNA, termed "pSYS 1", were subcloned into Bluescript® plasmlds, single-stranded 
DNA was rescued and sequenced on both strands by dideoxy sequencing using 
Sequenase (USB; Sanger et al. PNAS 73, 5463, 1977). Sequencing of the SP2 

25 positive clone established that it encoded the systemin polypeptide within the larger 
protein which was called "prosystemin". The prosystemin cDNA was not full-length, 
beginning at nucleotide 112 as numbered in FIGURE 6. 

The prosystemin cDNA consisted of 839bp with an open reading frame 
encoding 197 amino adds. The reading frame remained open to the 5'-end of the 

30 clone, and since Northern blot analysis indicated that the systemin mRNA was 1 Kb 
in size, we concluded that the cDNA was missing approximately lOObp at the 5'-end. 
The complete prosystemin mRNA sequence was subsequently determined by 
sequencing the prosystemin gene (as described below In Example 6) and mapping 
the transcriptional start site (FIGURE 7; Example 6). The experiments described in 

35 Example 6, below, established the length of the open reading frame as 600 base pairs 
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encoding a prosystemin protein of 200 amino acids. The identification of the initiating 
methionine codon was made on the basis of two criteria; the presence of multiple stop 
codons immediately 5' to the methionine codon and the presence of an adjacent 
sequence similar to the plant consensus sequence for translationa! initiation (H.A. 
5 Lutcke et al., 1987, EMBO Journal 6:43). 

EXAMPLE 4 
Structure and properties of prosystemin. 
Based on the cDNA sequence (FIGURE 7), systemin Is located close to the 
carboxy terminus of the 23 kDa prosystemin protein (FIGURE 7, amino acid residues 

10 179 to 196; corresponding to nucleotides 639 to 699). The amino acid composition 
of prosystemin is unusual in that it contains a high percentage of charged amino 
acids; aspartic acid (10%), glutamic acid (17%), lysine (15%). but very few 
hydrophobic amino acids. In consequence, prosystemin is a markedly hydrophilic 
molecule. Analysis of the prosystemin sequence failed to reveal a hydrophobic region 

15 at the amino terminus that resembles a leader peptide. The post-translational 
processing pathway and site of sub-cellular compartmentallzation of prosystemin 
remain to be determined. A search of the EMBL and GeneBank data base, with both 
the cDNA and deduced protein precursor sequences failed to reveal significant 
homology to any of the listed sequences. 

20 Although the 1 8-amino acid systemin sequence occurs only once within the 

precursor, close to the carboxy terminus, other sequence elements are repeated. 
There is a short (6 to 9 amino acids), imperfect repeat occurring five times within the 
prosystemin sequence (crosshatched underlining, FIGURE 7). This observation 
suggests that at least part of the prosystemin gene may have evolved by multiple 

25 gene duplication/elongation events, a conclusion which is supported by the structure 
of the gene. 

EXAMPLE 5 

Proteolytic processing sites in prosystemin. 
The putative processing sites bordering systemin are shown in FIGURE 7 {i.e. 
30 amino acid residues 178 and 197). The Leu (178) and Asn (197) processing sites do 
not conform to the consensus sequence for the endoproteolytic processing sites 
flanking bioactive peptides within animal prohormone precursors [e.g., see animal 
sites in R.B. Harris, 1989, Arch. Biochem. Biophys. 275(2) :31 5 (1989). The minimum 
animal consensus sequence consists of a pair of basic amino acids which immediately 
35 precede the site of cleavage. In addition, the dibasic pair is often preceded, at a 
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distance of two or three amino acids, by a single basic amino acid. The animal 
consensus sequence Is. however, found once within the prosystemin sequence, at 
residues 183-188 (LysProProSerLysArg. FIGURE 7), which is a part of the mature 
systemin polypeptide. It is conceivable that the half-life of systemin is regulated by 
5 further processing at this site, e.g., to yield an 8 amino acid carboxy-terminal peptide. 

In animal systems prohormones are often processed to yield multiple bioactive 
peptides (J. Douglass, O, CIvelli and E. Herbert, 1984. /Inn. Rev, Biochem. 53:665; LJ. 
Jung and R.H. Schefler. 1 991 , Science 251 :1 330) and members of the systemin family 
of plant polypeptide hormones may be subject to similar processing mechanisms. 

10 EXAMPLES 

Structure of the prosystemin gene and systemin gene family. 
A primary library of 700,000 recombinants was plated on the bacterial strain 
(P2) PLK-17 (Stratagene) and screened, by duplicate plaque lifts, with nick-translated 
prosystemin cDNA. Hybridization was carried out as described below. A single 

15 positive clone was identified and purified: The gene was located on a 18Kb genomic 
DNA fragment from which it was sub-cloned into Bluescript^^ piasmids. A series of 
overlapping, deletions spanning most of the gene were produced using the Mung 
Bean/Exonuclease m system (Stratagene), Each deletion product was cloned into 
a Bluescript® phagemid from which single-stranded DNA was rescued for use as the 

20 sequencing template. The gene was sequenced by the dideoxy method of Sanger 
(supra) using Sequenase (U.S.B.). The sequence was completed using custom-made 
oligonucleotide primers. The deduced sequence was then confirmed using custom- 
made oligonucleotide primers (as described in the Materials and Methods below). 
The sequence of the prosystemin gene is presented in FIGURES 8A-8C. The 

25 gene is composed of 4526 bp comprising a 1 04 bp 5'-untranstated region, a 4176 bp 
coding region and a 246 bp 3'-untranslated region. A striking feature of the sequence 
is that it is 76% A:T-rich. The structure of the prosystemin gene is depicted in FIGURE 
9A; Southern blot analysis is shown in FIGURE 9B. 

Within the prosystemin gene ten introns interrupt the coding region (FIGURE 

30 10). in consequence, the exons are small, ranging, in size from 34 bp (exon 1) to 90 
bp (exon 10). The first 10 exons are organized as five pairs while the sequence 
encoding systemin is located on the final, unpaired exon. To investigate the 
relationship between the exon pairs we aligned the sequences of the first exon of 
each pair (exons 1 , 3, 5, 7, 9) and, separately, we aligned the sequences of the 

35 second exon of each pair (exons 2, 4, 6, 8. 1 0), as shown in FIGURES 1 1 A AND 1 1 B. 
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The sequences within the first exon group are all homologous to each other as are 
the sequences within the second exon group. There is no significant sequence 
hoPDology between the two groups. These observations suggest that the five exon 
pairs arose by successive duplications from a common ancestor. That the individual 
5 exons within a pair are not homologous to each other suggests that the ancestral unit 
from which the gene is derived was a structure corresponding to the exon pair rather 
than to the individual exons within a pair. 

In contrast to the set of five homologous exon pairs, the exon encoding 
systemin (exon 1 1 ) does not show significant sequence homology with any other part 

10 of the prosystemin gene. This observation suggests either that the exon encoding 
systemin may have arisen separately from the rest of the gene, to which it was 
subsequently added, or that the exon encoding systemin arose from the same 
ancestral sequence as the other exons but subsequently evolved more rapidly. 
Repeated amino acid sequences within prosystemin and tlie systemin gene family. 

15 The amino acid sequence of prosystemin, like that of the gene, is highly 

repetitive. A short oligopeptide sequence palendrome was identified above in 
systemin (Example 1 ) and a similar theme occurring five times within prosystemin that 
Is encoded by the first exon of each of the five homologue pairs. In addition, the 
presence of three different, tandem ly-repeated polypeptide elements within 

20 prosystemin provides a clue about the evolution of the prosystemin gene. 

The tandemly repeated elements occur once within the amino-terminal half of 
prosystemin and once within the carboxy-tenminal half of prosystemin. The 
polypeptide elements within the amino-terminal half of the precursor have been 
labelled Rep A, Rep B and Rep C and their repeats within the carboxy-terminal half of 

25 the molecule have been labelled Rep 2A. Rep 2B and Rep 2C. The sequences of 
these repeated polypeptide elements are shown aligned in FIGURE 12. The locations 
of the repeats within prosystemin are shown in FIGURE 1 3A, while the locations within 
the prosystemin gene of the DNA sequences encoding the polypeptide repeats are 
shown in FIGURE 138. As can be seen from this comparison the Rep a, Rep B and 

30 Rep C polypeptides are encoded by two exon pairs (namely, three plus four and five 
plus six) while Rep 2A, Rep 2B and Rep 2C are also encoded by two other exon pairs 
(namely, seven plus eight and nine plus ten). 

The observations thus suggest that a set of polypeptide sequences, encoded 
by two pairs of exons. has been duplicated as one step in the evolution of the 

35 prosystemin gene. It would then appear that the ancestral gene for prosystemin was 
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subject to a tandem duplication of a structure composed of at least two pairs of 
exons. This conclusion precludes a model in which the systemin gene evolved simply 
by successive duplications of a single exon pair. 
Intron boundaries may be shifted wiViin repeated DNA sequences, 
5 The amino acid sequences between the polypeptide Rep A, Rep B and Rep 

C regions, that was found in the amino terminal half of prosystemin, were not 
duplicated in the carboxy terminal half of the molecule where repeats Rep 2A, Rep 2B 
and Rep 2C are almost contiguous (FIGURE 13A). This observation is largely 
explained by the fact that exon 7 (encoding element Rep 2A). is truncated at its 3'-end 

10 (i.e., with respect to the sequence of exon 3. encoding element Rep A). Similarly, 
exon 9 (encoding element Rep 2C), is truncated at its 5 -end (I.e., compared to exon 
5, encoding element Rep C). 

In the case of exon 7. the truncation did not arise by deleting a short section 
of the gene, but by shifting the position of the intron boundary within the existing 

1 5 sequence. Comparing the sequence around the Intron junction at the 3*-end of exon 
7 with the corresponding junction sequence at the 3'-end of exon 3 (FIGURE 14), it 
is evident that the sequence corresponding to the S'-end of the exon 3 constitutes the 
5*-end of the intron between exons 7 and 8. This structure may have arisen by either 
the elongation of exon 3 or the shortening of exon 7. It Is not dear if the same 

20 process resulted In the truncation of the 5'-end of exon 9. 
/Materials and li^ethods: 

Primer extension was performed using a gel-purified oligonucleotide consisting 
of bases 82 to 1 11 of the antisense strand of the cDNA sequence shown in FIGURE 
7. Total RNA was extracted from young tomato plants 4 hr after wounding and poly 

25 A+ mRNA was isolated using oligo dT columns (Pharmacia). Three picomoles of 
oligonucleotide were end-labelled using y-ATP at a specific activity of 6000/Ci/mmol. 
2x10® cpm of labelled oligonucleotide were annealed to 4 jig polyA-f mRNA by 
heating to 85*^0 for 10 min then hybridizing overnight at 30°C in a solution of 40 mM 
PIPES, pH 6.4; 1 mM EDTA; 0,4 M NaCI; 80% formamide. The annealed nucleic acids 

30 were ethanol precipitated and resuspended in 30 \i\ of a solution containing: 50 mM 
Tris, pH 7.5; 75 mM KCI; 10 mM dithiothreitol; 3 mM MgClgi 500 \xM of each dNTP; 
100 iig/ml bovine serum albumin. Thirty units of M-MLV Reverse Transcriptase 
(Stratagene) and one unit of RNase Block II (Stratagene) were added and the reaction 
mixture was incubated for 90 min at 37°C. At the end of the reaction one ^1 of 0.5 M 

35 EDTA, pH 8.0 and one pi of DNAse-free RNAse A (10 mg/ml) were added to the 
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reaction mixture and incubated for a further 30 min at STC. The reaction mbrture was 
phenol extracted, ethanol precipitated and resuspended in four ^il TE buffer (10 mM 
Tric-HCL, pH 7.5, 0.1 mM EDTA, pH 8.0) to which six \x\ of formamide loading buffer 
(80%formamide; 10 mM EDTA, pH 8.0; 1 mg/ml xylene cyanol; 1 mg/ml bromophenol 
5 blue) were subsequently added. Two pi of the resuspended products were analyzed 
on a 6% acrylamide/8M urea sequencing gel. The size standards were sequencing 
products generated using the primer extension oligonucleotide as primer and single- 
stranded DNA derived from the 5'-end of the prosystemin gene as template. 
Sequencing was carried out using Sequenase (USB) following the manufacturer's 

10 instructions for generating sequencing products close to the primer. 

Mung Bean Nuclease analysis was carried out using a 400bp Scal-Ndel 
fragment spanning the 5'-end of the prosystemin gene. The Ndel site is located within 
the first exon of the systemin gene. The Ndel end of the fragment was end-labelled 
to a specific activity of 6 x 10® cpm/jxg and approximately 10® cpm were mixed with 

15 4 |xg of the same poly A+ RNA stock used in the primer extension experiment. The 
mixture was desiccated and resuspended in 1 5 jil of hybridization buffer. The mixture 
was covered with mineral oil, heated to 82°C for 6 min then hybridized overnight at 
37*C. The sample was then mixed with 200 pi of ice-cold Mung Bean Nuclease buffer 
(30 mM sodium acetate (pH 5.0), 50 mM sodium chloride, 1 mM zinc chloride, 5% 

20 (v/v) glycerol) to which 10 units of Mung Bean Nuclease (Stratagene) were added. 
The mixture was incubated for 30 min at 12*Cthen extracted with an equal volume of 
a 1 :1 mixture of phenolrchloroform. The digestion products were copreciprtated with 
1 (ig of yeast tRNA and resuspended in 4 |il TE buffer plus 6 \il formamide loading 
buffer. Three |il of the resuspended digestion products were analyzed on a 6% 

25 acrylamide/8M urea gel. Size markers were generated by using single-stranded DNA 
corresponding to the 5'-end of the gene as template. The sequencing primer was a 
19-mer corresponding to the first 19 bases (antisense strand) at the 3'-end of the 
Scal-Ndel probe fragment. 

EXAMPLE 7 

30 Wound'inducible expression of the prosystemin gene. 

In considering the role of systemin as a mobile signal that activates proteinase 
inhibitor genes in response to wounding, we investigated the possibility that the 
prosystemin gene, Itself, might be wound-inducible. Northern blot analysis was used 
to examine the levels of prosystemin mRNA and Inhibitor I mRNA in leaves of 

35 unwounded and wounded tomato plants (FIGURE 15A). Thirty-two young tomato 
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plants were wounded three weeks after germination. The plants had an upper and 
a lower leaf and a small apial leaf. The lower leaf was wounded and mRNA was 
isolated from the upper (unwounded) leaf at the following time points after wounding: 
0.5, 1.5, 3, 6, 9, 12, and 24 hours. Four plants were used for each time point. Total 
5 RNA (5 Jig) from each time point was electrophoresed on a 1.4% agarose- 
formaldehyde gel and blotted onto nitrocellulose. The blot was probed simultaneously 
with nick-translated prosystemin (SYS) and Inhibitor! (lnh-1) cDNAs (see Materials and 
Methods, below). Prosystemin mRNA was found to accumulate in both wounded and 
unwounded leaves of wounded tomato plants, demonstrating that prosystemin mRNA, 

1 0 like Inhibitor I mRNA, is systemically wound-inducible. Prosystemin mRNA reached 
the highest levels at three to four hours after wounding while Inhibitor I mRNA was 
most abundant eight to ten hours after wounding. Unlike the proteinase Inhibitor I 
message, which is absent In the leaves of unwounded tomato plants, a low level of 
prosystemin mRNA was detected in the leaves of unwounded plants. Low, 

1 5 constitutive expression of the prosystemin gene in leaves may provide a continuous 
supply of systemin, allowing the plant to immediately respond to wounding. 

The wound-induced accumulation of prosystemin mRNA and, presumably, 
prosystemin and systemin In the unwounded tissue may amplify the ability of the plant 
to react to subsequent damage. Continued damage by insect attacks would, 
20 therefore, liberate more systemin from the newly synthesized precursor than did the 
initial wounds, resulting In higher levels of proteinase inhibitor synthesis as the attacks 
persist. 

Since the initial rate of accumulation of prosystemin mRNA was faster than that 
of Inhibitor I mRNA In response to wounding (FIGURE ISA), some aspects of the 
25 signal transduction pathways activating the two genes may differ. Additional signals 
may be responsible for the different rates of accumulation or the signal transduction 
pathways may respond to the same signals but wth different sensitivities. 
Materials and Methods: 

Nick-translation was performed using the NEN DuPont nick-translation system 
30 according to the manufacturer's instructions. Hybridization was can-led out under the 
following conditions: 50% formamide; 5x Denhardts; 5X SSPE; 0.1% SDS; 100 ^g/ml 
sheared salmon sperm DNA; 1 pg/ml poly A and nick-translated DNA probe of specific 
activity approximately 10^ cpm/jig. Unless othenwise stated, blots were washed in 1 
XSSC, 0.1%SDSat65°C. 

35 
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EXAMPLE 8 

Distribution of prosystemin mRNA througtiout the plant 
Prosystemin mRNA is found throughout the aerial parts of the plant but not in 
the roots (FIGURE 15B), Total RNA was extracted from the following parts of an 
5 unwounded. fully-grown tomato plant: root (R); stem (St); petiole (R), leaf (Le), sepal 
(Se), petal (Pe). stamen (Sm) and pistil (Pi)(FIGURE 15B). Total RNA (5^g) from each 
sample was electrophoresed and blotted as described in Example 7. The blot was 
probed with nick-translated prosystemin cDNA (as described in Example 7, above). 
The highest constitutive levels of prosystemin mRNA are seen in the flower 
1 0 parts, a feature which is also characteristic of the distribution of Inhibitor I and Inhibitor 
n mRNAs. The general distribution of prosystemin mRNA (at least In the parts of the 
plant above ground), is consistent with the proposed role of systemin as a wound 
signal, since wounding of any aerial part of the plant would be expected to result in 
the systemic induction of proteinase inhibitor synthesis. The apparent absence of 
1 5 prosystemin mRNA in the roots is surprising since we have observed the induction of 
proteinase inhibitor synthesis in tomato leaves in response to wounding of the roots. 
It is possible that roots either contain very low levels of prosystemin mRNA, 
undetectable in the assay, or they employ a different wound signal(s) (e.g., a different 
systemin gene family member) to activate proteinase inhibitor genes in the leaves. It 
20 is also conceivable that prosystemin is transported from leaves to roots where 
systemin is released in response to wounding. 

EXAMPLES 

Species distribution of prosystemin gene tiomologues. 
To determine if prosystemin gene homologues are found in other plant 
25 species, Southern and Northern blot anafysis was performed on genomic DNA and 
total RNA from three species known to possess wound-inducible proteinase inhibitors: 
potato, Solanum tuberosum, var. Russett Burbank (C.A. Ryan, 1968, Plant PfiysioL 43, 
1 880), tobacco, Nicotians tabacum, var. Xanthi (G. Pearce, results in preparation), and 
alfalfa. Medicago sativa, var. Vemema (W.E, Brown and C.A. Ryan, 1 984, Bioctiemistry 
30 23:341 8; W.E. Brown, K. Takio, K. Trtani, C.A. Ryan, 1 985, Biochemistry 24:2^ 05); and. 
as a control, from one species {Arabidopsis thaliana, var. Columbia), which is not 
known to possess wound-inducible proteinase inhibitors. 

Southern blot analysis of the species distribution of prosystemin gene 
homologues is shown in FIGURE 9C. Genomic DNA (5^g) from tomato (FIGURE 9C. 
35 lane 1 ), potato (FIGURE 9G, lane 2), tobacco (FIGURE 90, lane 3), alfalfa (FIGURE 9C. 
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lane 4), and Arabidopsis (FIGURE 9C. lane 5) was digested with EcoR1; restriction 
fragments were separated by electrophoresis on a 0.8% agarose gel; and, fragments 
of prosystemin were visualized by blotting to nitrocellulose and probing with nick- 
translated prosystemin cDNA. The blot was washed at 55**C under moderately 
5 stringent conditions. Of the four plant species analyzed, a homologue of the 
prosystemin gene was identified under moderately stringent conditions only in potato 
(the nearest relative of the four species to tomato). A potato mRNA species was also 
identified that hybridized to tomato prosystemin cDNA and which comigrated with the 
tomato prosystemin mRNA. The nucleotide sequence of the gene in tobacco, alfalfa 

1 0 and Arabidopsis may have diverged from that of the tomato gene to the extent that 
St can no longer be detected by hybridization under stringent conditions with the 
tomato prosystemin cDNA. This interpretation is favored by the findings that a 
homologue could not be detected at greatly reduced hybridization and wash 
stringencies. Analysis of extracts from other plant genera should provide further 

15 insights into the distribution and evolution of members of the systemin gene family. 

EXAMPLE 10 
Antisense suppresion of the prosystemin gene. 
To determine if the prosystemin gene product has an important role in the 
systemic signal transduction leading to the expression of proteinase inhibitor genes 

20 In tomato leaves, a prosystemin antisense DNA was constructed and was used to 
transform tomato plants. The chimeric antisense DNA was composed of prosystemin 
cDNA, in the antisense 3' to 5' orientation, under the control of the constitutive CaMV 
35S promoter and inserted into the binary vector pGA643. 
Materials and Mettiods: 

25 Strand-specific, radiolabeled RNA probes were produced from the prosystemin 

cDNA using T3 and T7 RNA polymerases (Stratagene) according to the manufacturer's 
instructions. 

The antisense DNA construct was transformed Into Agrobacterium strain 
LBA4404 and the recombinant bacteria were used to transform tomato var. Better Boy. 

30 As controls for the primary transformants, tomato plants were transformed with the 
binary vector alone. Eighteen antisense plants and twenty one controls were 
regenerated. Three weeks after the transformed plants had been transferred to soil 
the lower leaves on each plant were extensively wounded and the levels of wound- 
inducible proteinase Inhibitors I and U were determined in the expressed juice of 

35 upper leaves twenty four hours later (C.A. Ryan, 1967, AjtaL Biochem. 19:434; R. 
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Trautman, K.M. Cown, G.G. Wagner. 1971, Immunochemistry 8:901). None of the 
plants were producing either Inhibitor I or inhibitor n in their leaves prior to wounding. 
Of the 18 plants containing the antisense gene, 11 plants produced Inhibitor I at less 
than 40% of the mean control level of 126.7 +/- 18.6 pg/ml leaf juice and Inhibitor n 
5 at less than 30% of the mean control level of 1 64.7 +/- 1 8.6 jig/ml leaf juice. 

FIGURE 16A shows Northern blot analysis of total RNA isolated from one of 
the antisense plants, designated 1 A4, using both sense and antisense-specific single- 
stranded RNA probes. Two samples of total RNA (Sjig) were electrophoresed and 
blotted as described above. The samples were probed separately with radiolabeled 

10 RNA probes specific for sense (FIGURE 16A, lane 1) and antisense (FIGURE 16A. lane 
2) prosystemin nriRNA (see Examples 6-8, above). 

In the wounding experiment the distal leaves of plant 1A4 expressed Inhibitor 
I at 42 |ig/ml leaf juice and Inhibitor n at 41 jig/ml leaf juice in response to wounding. 
The antisense RNA appeared as a band at approximately 1 .7 kilobases (FIGURE 16A, 

15 lane 2) compared to the prosystemin mRNA at 1 Kb (FIGURE 16A, lane 1). southern 
blot analysis showed that plant 1A4 contained a single copy of the antisense 
construct. This conclusion was confirmed by self-fertilizing plant 1A4 and analyzing 
28F1 progeny by Southern blot analysis. Seven (one quarter) of the 28 F1 progeny 
did not inherit the antisense construct. This experiment also demonstrated that the 

20 antisense construct was stably inherited in the F1 generation. 

To demonstrate that the antisense phenotype segregated with the antisense 
construct, the levels of Inhibitors I and II in the distal leaves of the 28 F1 plants were 
measured before wounding, and 24 hr after wounding. FIGURE 16B graphically 
depicts wound-induced accumulation of proteinase Inhibitor I and FIGURE 16C 

25 depicts wound-induced accumulation of proteinase Inhibitor 11 In the distal leaves of 
F1 antisense plants (unshaded bars) and untransformed controls (solid bars). 
Antisense plant 1 A4 was self-fertilized and the amounts of wound-induced proteinase 
Inhibitors I and II in the distal leaves of three-week old F1 progeny were measured 
by radial immunodiffusion assay (as described below). The plants had an upper and 

30 a lower leaf and a small apical leaf. The lower leaf was wounded and 24 hours later 
juice was expressed from the upper, unwounded leaf and assayed. The amount of 
Inhibitor I was measured in 28 F1 plants while the level of Inhibitor n was measured 
in 27 of the 28 F1 plants. A control group of thirty untransformed tomato plants, var. 
Better Boy, was also wounded and the amounts of Inhibitors I and n were measured. 

35 Inhibitor proteins were not detected in juice expressed from the leaves of six 



BNSCOCID: <WO _9319079A1_I_> 



wo 93/19079 



PCr/US93/02428 



- 31 - 

unwounded antisense plants and six unwounded control plants. Three quarters of the 
antisense plants (/.e., those inheriting the antisense construct), responded weakly to 
wounding compared to the control population of untransformed plants (FIGURES 16B 
AND 1 6C). Plants not inheriting the construct produced levels of proteinase inhibitors 
5 equal to those of the untransformed control plants. 

In six of the 28 F1 antisense plants Inhibitor I synthesis in the distal leaf was 
less than 15% of the mean control level of 97.2 +/- 4.7 |ig/ml while Inhibitor n 
synthesis was undetectable in the distal leaf (mean control level of 122.3 +/- 7.2 
jig/ml). Southern blot analysis of the six least responsive F1 plants suggests that 
1 0 these plants inherited two copies of the antisense construct, although this conclusion 
must be confirmed by self-fertilizing the plants and demonstrating that none of the F2 
progeny produce proteinase inhibitors in response to wounding at levels equal to 
those of the control plants. 

These experiments show that expression of antisense prosystemin mRNA in 
1 5 tomato inhibits the systemic induction of proteinase inhibitor syntheses in response 
to wounding, ft is Inferred that antisense prosystemin mRNA prevents the efficient 
production of prosystemin and, hence, of the mobile systemic wound-signal systemin. 

Materials and Methods: 

A 747 bp fragment of the prosystemin cDNA was excised from pSYS 1 as a 

20 BamHI-Hind m fragment. The BamHI site is located in the bluescript polylinker close 
to the 5'-end of the cDNA, while the Hind III site is within the cDNA at nucleotide 859 
as numbered in FIGURE 7. The antisense cDNA fragment thus contained all of the 
prosystemin mRNA sequence except for the first seven bp of the coding region, all of 
the 5'-untranslated region and the last 92 bp of the 3'-untranslated region. The cDNA 

25 fragment was placed under the control of the constitutive CaMV 35S promoter by 
cloning it (in the antisense 3* to 5' orientation), into the polylinker of the binary vector 
pGA643 digested with Bgl II and Hind III. The antisense construct was transformed 
into Agrobacterium strain LBA 4404 and the recombinant bacteria were used to 
transform tomato var. Better Boy. 

30 Tomato seeds, var. Better Boy, were sterilized by soaking for 15 min in a 1 5% 

(v/v) solution of Chlorox containing two or three drops of Tween 20. The seeds were 
washed four times with distilled water then geminated on medium containing: MS 
salts (4.3 g/L), agarose (6g/L) and thiamine (1 mg/L), pH 5.8. The geminating plants 
were grown at 28**C with 16 hr days. Eighty percent of the seeds germinated. After 

35 7-1 0 days, when the first true leaves appeared, the cotyledons were removed from the 
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seedlings and cut into cubes of edge length 0.2-0.5 cm. The tissue cubes were 
preconditioned on tobacco feeder plates for two days at 25^*0 in the dark. Tobacco 
feeder plates were prepared by subculturing tobacco (hJT-l) suspension cells in 
medium containing: MS sales (4.3 g/L), sucrose (30 g/L). inositol (0.1 g/L)» thiamine 
5 (1 mg/L), 2. 4-D (0.2 mg/L) and KH^PO^ (0.1 8 g/L) at pH 5.8. The cells were Incubated 
for four days at 25*^0 in the dark. The cells were plated over the same medium 
including 0.7% agarose, then incubated under the same conditions as before for a 
further two days. Pieces of tomato cotyledon were placed on Whatman No. 4 filter 
paper soaked in tobacco feeder plate medium and overlaid onto the tobacco feeder 

10 plates. The pieces of preconditioned tissue were punctured with a 20-gauge needle 
and infected with Agrobacterium by soaking them for thirty min in 15 mL of 
germination medium containing 10® cells/mL The tissue was blotted dry with sterile 
filter paper and incubated on tobacco feeder plates for a further two days at 25**C in 
the dark. The tissue pieces were then washed three times in germination medium, the 

15 third wash containing 0.5 g/L of Cefotaxime. The tissue pieces were blotted dry with 
sterile filter paper and placed on shooting medium containing: MS sales (4.3 g/L), 
thiamine (10 mg/L), nicotinic acid (1 mg/L). pyridoxine (1 mg/L), inositol (100 mg/L), 
sucrose (30 g/L), BAP (2.5 mg/L). I/\A (1 mg/L), cefotaxime (250 mg/L). carbenicillin 
(500 mg/L). kanamycin (100 mg/L) and 0.7% (w/v) agarose. The explants were 

20 transferred after the first three days of culture and weekly thereafter. Once callus 
growth was observed (after the third subculture) the explants were transferred to 
shooting medium from which the lAA and BAP had been removed and zeatin (2 mg/L) 
added. Once the shoots were 2-3 inches tall they were transferred to rooting medium 
which differed from shooting medium in the BAP, cefotaxime and carbenicillin were 

25 absent, vancomycin (0.5 g/L) was added and the concentrations of sucrose (20 g/L). 
kanamycin (20 mg/L) and \AA (0.05 mg/L) were reduced. 

EXAMPLE 11 

Effect of growth on t^anduca sexta larvae feeding on transgenic piants. 
Northem blot analysis of total RNA extracts, collected at different times from 
30 undamaged leaves of control (nontransformed) and transgenic tomato plants was 
performed during feeding experiments with l^anduca sexta larvae. Plants were 
approximately 18 inches In height, having 2-3 main stems. Total RNA from each 
sample was separated by electrophoresis, blotted onto nitrocellulose paper and 
probed with a nick-translated ^^P-prosystemin cDNA. The results are shown In Figure 
35 17. 
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The accumulation of Inhibitor I and II proteins in undamaged leaves of control 
and trasgenic antisense tomato plants, induced by feeding on Manduca sexta larvae 
was measured. Leaf juice was expressed with a mortar and pestle and assayed by 
radial immunodiffusion assay (Ryan. 1967). As shown in Figure 18, transgenic 
5 antisense tomato plants showed decreased accumulation of both Inhibitor I and n 
proteins. In Figure 18, A = transgenic antisense plants, C = wild type plants, PI = 
Inhibitor I and Pn = Inhibitor n. 

The growth of Manduca sexta larvae, while feeding on leaves of control and 
transgenic antisense tomato plants was measured. Ten first instar larvae were placed 

1 0 randomly on each of six control (wild tape) and transgenic plants and removed and 
weighed. Those larvae feeding on control plants were weighed at 14 days and those 
feeding on transgenic antisense plants were weighed at 10 and 14 days. As shown 
in Figure 1 9, those larvae feeding on transegenic antisense plants showed increased 
growth weight over those feeding on wild type plants. Values in Figure 19 are the 

1 5 average from three different plant experiments, statistical significance is shown. 

Transgenic plants containing a prosystemin gene driven by a CaMV promoter 
(Example 10, above) were produced. The prosystemin gene was essentially the same 
as the antisense gene (McGurl ef a/. , Science 255, 1 570-1 573), except the prosystemin 
cDNA was in the sense orientation. Manduca sexta lan^ae were allowed to feed on 

20 leaves of control (wild type) and transgenic sense tomato plants. The control leaves 
contained little or no Inhibitor I or II. The leaves of the transgenic plant expressing 
prosystemin contained about 300 |i/g tissue Inhibitor I and about 200 ji/g tissue 
Inhibitor U. As shown in Figure 20, the weight of larvae feeding on the control leaves 
increased at a faster rate than the lan^ae feeding on the transgenic sense leaves. 

25 Those skilled in the art can now appreciate from the foregoing description that 

the broad teachings of the present Invention can be implemented in a variety of forms. 
Therefore, while this invention has been described in connection with particular 
examples thereof, the true scope of the invention should not be so limited since other 
modifications will become apparent to the skilled practitioner upon a study of the 

30 specification and following claims. 

All publications and applications cited herein are incorporated by reference. 
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SEQUENCE LISTING 

(l)CENERAX INFORMATION: 
5 (i) APPLICANT:Ryan,C.A. ?McGurl,B.F. ;Pearce,C.L- 

(ii) TITLE OF INVENTION : -SYSTEMIN* 

(iii) NUMBER OF SEQUENCES: 4 

( iv ) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Chri»t«n««n, O'Connor, Johnson and 
10 Kindn«BS 

(B) STREET:2800 Pacific First Center, 1420 Fifth Av«nu« 

(C) CXTy:S«attl* 

( D ) STATE : Washington 

(E) COUNTRY: USA 

15 (F)2XP: 98101-2347 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TyPE:Di»kette-5.25 inch, a.2Mb atorage 

(B) COMPUTER: IBM PC/3B6 Compatible 
CC) OPERATING SYSTEM : MS-DOS 4,01 

20 (D) SOFTWARE: Word for Windowa-t 

(vi)CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: none 

(B) FILING DATE: 03/ 19/92 

( C) CLASSIFICATION : 

25 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: none 

(B) FILING DATE: none 
(viii) ATTORNEY /AGENT INFORMATION: 

(A)NAM£: Sundsmo, John, S. 
30 (B) REGISTRATION NUMBER: 34 , 446 

( C ) REFERENCE /DOCKET NUMBER: WSUR-1-6 17 1 
(ixj TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 1-206-682-8100; 1-206-224-0727 (direct) 

(B) TELEFAX: 1-206-224-0779 
35 (C)TELEX: 4938023 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 aroino acids 
40 (B) TYPE; amino acid 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: polypeptide 

( A) DESCRIPTION : proaystemin #l-#200 ; syatcmin #179-196 ; 

Figure 6-7 . 

45 (ix) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Gly Thr Pro Ser Tyr Asp He Lys Asn Lys Gly Asp Asp Met Gin 
15 10 15 

Glu Glu Pro Lys Val Lys Leu His His Glu Lys Gly Gly Asp Glu Lys 
50 20 25 30 

Glu Lys He lie Glu Lys Glu Thr Pro Ser Gin Asp He Asn Asn Lys 
35 40 45 
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Xep Thr lie S«r Ser Tyr Val Leu hrg Asp hmp Thr Gin Clu II* Pro 

50 55 60 

Ly» Ket Clu His Clu Clu Cly Cly Tyr Val Lye Clu Lys He Val Clu 
65 70 75 80 

Ly« Clu Thr He Ser Cln Tyr He He Lye He Clu Cly Aep Aep Aep 
85 90 95 

Ale Cln Clu Lye Leu Lys Val Clu Tyr Clu Clu Clu Clu Tyr Clu Lye 
100 105 110 

Clu Lye He Val Clu Lye Clu Thr Pro Ser Cln Aep He Aen Aen Lye 

115 120 125 

Cly Asp Asp Ala Cln Clu Lye Pro Lye Val Clu Hie Clu Clu Cly Aep 

130 135 140 

Asp Lys Clu Thr Pro Ser Gin Asp He He Lys Met Clu Cly Clu Cly 
145 150 155 160 

Ala Leu Glu He Thr Lye Val Val Cys Clu Lys He He Val Arg Clu 
165 170 175 

Asp Leu Ala Val Cln Ser Lys Pro Pro Ser Lye Arg Aep Pro Pro Lys 
180 185 190 

Met Gin Thr Asp Aen Asn Lys Leu 
195 200 

(2 )INrORHATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L£NCTH:951 bases 

(B) TyPE:nucleic acid 

( C ) STRANOEDKESS : single 
{ D ) TOPOLCXJy : linear 

( ii ) MOLECULE TYPE : CDNA 

(A)DESCRIPTION:prosy8temin cDNA; start ATC at #105 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AAAATTAAAT TTCATATTTC CTTTAACTCG ATTTTCCATG AACACCCTTA CTCATCACTA 60 

TATAAACCTC ACCTCATCAA CACTTCAAAT AAACTAACAA AACCATCCCA ACTCCTTCAT 120 

ATGATATCAA AAACAAAGGA GATCACATGC AAGAAGAACC AAACGTGAAA CTTCACCATG 180 

AGAAGGGAGG AGATGAAAAC CAAAAAATAA TTGAAAAAGA CACTCCATCC CAAGATATCA 240 

ACAACAAAGA TACCATCTCT TCATATCTTT TAAGACATGA TACACAAGAA ATACCAAAGA 300 

TCGAACATGA CGAGGGAGGA TATGTAAACG AGAAAATTGT TGAAAACGAG ACTATATCCC 360 

AATATATCAT CAACATTGAA GGAGATGATG ATCCACAAGA AAAACTAAAG GTTGACTATC 420 

AGGAGGAAGA ATATGAAAAA CAGAAAATAG TTGAAAAAGA CACTCCATCC CAAGATATCA 480 
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ACAACAAAGG AGATCATGCA CAAGAAAAAC CAAAGCTGCA ACATCAGGAA GGAGATCACA 540 
AAGAGACTCC ATCACAAGAT ATCATCAAGA TCCAACCCCA GGCTCCACTA CAAATAACAA 600 

5 

AGCXGGTATC TCACAAAATT ATAGTACCAG AAGATCTTCC TCTTCAATCA AAACCTCCAT 660 
CAAACCCTGA TCCTCCCAAA ATGCAAACAG ACAATAATAA ACTCTAGAAA CATCCAAAAA 720 
10 AAATTAATAA ATAAAAAATT ATAATTCAGA ACGATAAAGT AAAAATTCTG AATTTGTCTC 780 

CCCTTAGAAA AGTXACTTCA AATAAATATT TGTCTTTCTT TCTATTTTCA AAGTCTAATT 840 
TCCTTATTCT ACTTTGAGAA GCTTTCTTTA GATTGTTATG TACTTCTATT GCTTCCTTTC 900 
TTTTCGCTTA TTTATATAAT ATAAATAAAA AATAAATAAA TATCTAAAGA T 951 



15 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i)SEQUENCE CHARACTERISTICS: 
20 (A)LENCTH:18 amino acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : linear 
{ i i ) MOLE CULE TY PE : pept ide 

(A ) DESCRIPTION :«y«temin; Figure 3. 
25 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Val Gin Ser Lya Pro Pro Ser Lya Arg Asp Pro Pro Lys Met Gin Thr Asp 
15 10 IS 

(2) INFORMATION FOR SEQ ID NO: 4: 
30 (i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4526 bases 

(B) TYPE:nucleic acid 
(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE:DNA 
35 (A) DESCRIPTION :systemin genomic DNA; Figures 8A-8C. 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AAAATTAAAT TTGATATTTC GTTTAACTCG ATTTTCCATG AACACCCTTA GTGATGAGTA 60 



40 



TATAAAGCTC AGCTCATGAA GAGTTGAAAT AAACTAAGAA AACCATGGGA ACTCCTTCAT 120 
ATGATATCAA AAACAAAGGT ATCATTTCTT TATATGCCTA AGTATATATT TATTTATATA 180 
TTTTGTAACT AAAATTTTAT ATTAAAATCA ACAAGTGAGA GTTTAACAAA AATCATATTA 240 
45 AAGAAAAAAA ATATATTAAT TTTTAATCAT GGTATTATCC TCCAGGAGAT GACATGCAAG 300 

AACAACCAAA GGTGAAACTT CACCATGAGA AGGTAACTTT AGTTTCTCCT TTTTCTTTTT 360 
TCAACTTCTT TATATATTAT TTTTGTAAAT TTTTTTATAT TATAATGTTC AAATGGTCTC 420 
ATTTTCTAAT TAATAATGTG TCTGAATCGC CATGTTATTT ATGTTAGATT TAATACATTA 480 
ATAACATTGT TAGTAAATGT TAGAATACTC ACTCCCAAAT TCCCTTAAGC AACAACTATA 540 



50 
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TTTCATGTGT TTCTTTCCAG ATAACAATAA 
CATAATATTT TXTCCTAGAA AACTCCAACT 
TTTAATTATA CTTTTTTAAA CATGATAAAT 
TTTTTTTACA AAAATTCTAT TTCCTCTCAA 
AAAATATCTC ATCTCATTTG ATTATATACA 
CGACATTTTA CCATTCATTA ACACTAATCT 
TCCTTAACTT TTCTCTATAA TTCACATTTG 
ATCAAATGTT CAAATAAAAT TCAATGACAA 
CTTCTTCATT CAACTTATTT ACCGTTATAA 
TTTTTACTCA CCTGAAATGA TTTAATCAAC 
AGAATAATGT TCTT T ATACT TCTATACAAT 
TATGTTTAAT TAAATACTAT TAAAATGAAC 
TTTTTTTTCC ATCAAAATTT ACACCGACGA 
ACTCCATCCC AACATATCAA CAACAAACAT 
TTTTTTTCAA XCTTTTTTTT TTC7CATCTT 
TTATGTTTTT TTTTAACTTT AATTATAATA 
ACCAAAGATG GAACATGAGG AGG7AACTAT 
AATGACTTAT TCATTGATTC AATTTATTTT 
TATTATCACC CCACTCCCTC CACTACTCAT 
ATTTTCTTTA TTCATAAGTC ATTTATTTTT 
ATACATTTTG TTTATACATA ATTCATCTAT 
ATTCCATACA TTTTGTTCAT ACATAAGTCA 
TTAACGTTTG CATACAATTT TGTTCATACA 
TCAGTTAAAC TTTTATATAC AATTTTGCTC 
TATGTTCAGT TAAACCTAAT AAATAAAATT 
GA7ATTACAG GGAGGATATG TAAAGGAGAA 
TATCATCAAG ATTGAAGGTA TAATCTATTT 
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TTATGTTTTG TAAAGCAAAT AAAATAATAA 600 
CATTATTTAG ATATTTAGAT TATGATTTGC 660 
TATTTCTCTT ACACATTTTC GATTCATTTT 720 
ACGTTTACTA CTTAGTTAAG TTAACTATAC 780 
TCAGGCTCAA TTAAAACATA TTGCAGA7AT 840 
CTATAGTTAG AAAATCTCAA ATATTTCAAA 900 
AAACTATATG TTTAATTATA ACAAACCGTA 960 
TAGGTATAAG GAGCTATCAA TATATTAGCT 1020 
TTAAATAATG ACTCCTTAAT TCATTTAATT 1080 
TCATTTATCA CCCTTATTTA CGACTCATCT 1140 
TTACTCGCAT ATTTTTTTTA AATTTTTTTT 1200 
AAATATTATT TATAATTGAA GAATATTGAA 1260 
GATGAAAAGG AAAAAATAAT TGAAAAAGAG 1320 
ACCATCTCTT CATATGTTTT AACTATTTAA 1380 
CTTATTTTAA TCATCTAAAA GAAATTATTA 1440 
TTATCCAGCA GCAGATGATA CACAAGAAAT 1500 
ATATTTCAAT TTATTTAC7A ATTTATAAAT 1560 
AATTCCTTTG AAATCAAACT AAGGTTACCA 1620 
TTAAAATGAT GGTTTGATAC TTTGCATGCA 1680 
CAAAAATTTT ATGTTCAGTT AAACGTTTGC 1740 
TTCTTTTAAA ATTTTATGTT CAGTTAAACG 1800 
TCTATTTTTT TAAAAAAAAA TTATGTTCAC 1860 
TAATTCATCT ATTTTTTTAA AATTTTATGA 1920 
GTACATAAGT CATCTATTTT TTTAAAATTT 1980 
AGACTCTGGA AATATTATTT ATTATTAAAG 2040 
AATTCTTGAA AACGAGACTA 7ATCCCAATA 2100 
A7ATCTCTC7 AAA7A7TTAA 7TTTATTTTT 2160 
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A TTTTTCAGA TTTTTTACTA ACCCATTTTT TTATTTTTTT TCAAAAAATC TCAATCATTT 2220 
TCAAGAACTT AATATTATTT TTCCTAACTT TAATCTTCAT ATATTATTCT CCACCACATC 2280 
ATCATCCACA AGAAAAACTA AAGCTTGACT ATGACCACGT AACTTTAATT TCTTCTTTCA 2340 
CTTTTTATTT ATTATTTTTC TATATTTTAC TCTCTATTTA TTTCATATTC ACAAATTATA 2400 
TTTATCACAT ATATATTCCT TTATTTTCTT CAAAATTACA CCAAGAATAT GAAAAACAGA 2460 
AAATAGTTGA AAAAGAGACT CCATCCCAAC ATATCAACAA CAAACCTATA TATCATATCT 2520 
TCATATGCCT AACATTTTAT T T T C TT C TTA TTTTTCATAT TATTTTTATT TAACTAAATT 2580 
15 TAGTATGAAA CCTTTTTTTT TTTAAAAAAA TCATCTTAAA TAAAATATTA TTTTTCCCTC 2640 

ACTCAAATCA TTGACCTTAT ATATTCTCCA GCAGATCATG CACAAGAAAA ACCAAACCTG 2700 
CAACATCAGC TAACTACTTA TATTTTTCTC TCTCTTTATT ACATAAAATC ACATTAGTTA 2760 
TATCATAATT CGCTATGCTA ATAATAAAAA AACAATTAAT ATATTTATAC CAATTTAAAC 2820 
AGCCTCGAGT GTCCATCATC TTTATTTTTA TCTTCTAAAG TTACTAACAC TATTTCCAAA 2880 
25 TAGACCTTTA CTTTGAGCAA AATCTATCAG AAAATACGAT AATAAAGAAG TCACCCTGAA 2940 

AATAAAATAT TAATTTTGTG ACGTGAAACC AATATCAAGA GCCCCCTCAA TTTCTTCTAT 3000 
TATGTCAGAT CCAACATCCT TCTTTCTTCT CCTGAAGTAT AGCAGCCCTT AGCACACATC 3060 
TCAACATAAT CCGCGATAAT AACGTTTTAA TGGTGAATCT ATCCGTATCA TAACAATAGT 3120 
ATACAACTTT AAACCTAATG ATCGTCTAGC TAGTAATCTT TCAAAATGAC GGACCCTAAT 3180 
35 TACTCACAAA ATTTGTGTCT AACATAACTT ATGTACCATA ACAATAATAT ATCTTGTGTA 3240 

ATTTATGAGT GAACCTAGGG TTTGAAATTA AACATAATCA ATAAAATTGG ACAAAGAAGA 3300 
TATTATTTAT TAATTGAAAG ATATTAATAG TTTTTTTCTT CAAAATTACA GGAAGCAGAT 3360 
GACAAAGAGA CTCCATCACA ACATATCATC AACATGGAAG GTATCAATCT ATTTATATTT 3420 
TTTATAAGTA T ' lI IT l I C TT ACAATTTTTT TATTTCCTTT GCTATATATG AAACTATTTT 3480 
45 TTTTAACCAT CTTTAAAAAA AAATAATACT TATGTATAAC TATAATCATG ATATTATCAT 3540 

CCAGGCGAGC CTGCACTAGA AATAACAAAG GTCGTATGTG AGGTAACTAA ATTTCTTCTT 3600 
CCAATTTTTC TATACATTAT GTTTGTATTT TTTTTTTTGG ATTCATTCGA ACTTTCTTCG 3660 
ATAGAAAGTC TTGCTATCTA TATACGATTA AAATTATATT GACTTTACGA TAAAAATATA 3720 
TTTAAACAAT TCTTTTTTTA ATTTCATATC TAAACTATTC AAAATGTGTC TGCCCTCGTA 3780 
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XCCTCCCTAC AAAGCCXACT AGAACCACAT 
GTGAGAAATA CGCTGAAACT ATCCCTTATT 
TATTGAATCC TCTTCTATTT ATTCCTATCT 
AAAATTCTGA CTTCACCACT CTATATATCT 
TGTTTACTTA TAATAACTAA TAACCCTCTT 
TTTTGGACAC TACAACGAGC ATACAATTCA 
TATTTTTTAT TATAAAAGGA TATXAATTTC 
ACGAGAAGAT C7TGCTGTTC AA7CAAAACC 
AACACACAAT AATAAACTCT AGAAACA7CC 
TCAGAACGAT AAACTAAAAA TTCTGAAT77 
ATATTTCTCT TTCTTTGTAT TTTCAAAGTG 
CTTTAGATTG TTATCTACTT CTATTCCTTC 
7AAAAAA7AA ATAAATA7CT AAAGAT 



41 - 

T7TAAATGAT TAAAAAAA7C 7TTTGAAACT 3840 
ATTTTATTTT TACCTATATC CAATAGACAA 3900 
TTACTTCCTC ACATATCAAA TCTCTTACTA 3960 
TTTTATTTTG ATTTTTCATT OCATTTCATT 4020 
TTATTTTATT TATAGCATGA TGCTACTATT 4080 
AATCTCAAAC TTTTTTATAT TTTTTTTCTA 4140 
■ rXTTTTCl Il CAAATACACA AAATTATAG7 4200 
TCCATCAAAC CGTGATCCTC CCAAAATCCA 4260 
AAAAAAAATT AA7AAATAAA AAAT7ATAAT 4320 
G7C7CCCG77 AGAAAAGTAA CTTCAAATAA 4380 
TAATTTGCTT ATTCTACTTT CAGAAGCTTT 4940 
CTTTCTTTTG GCTTATTTAT ATAATATAAA 4500 

4526 
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WHAT IS CLAIMED IS: 

1. A nucleic acid comprising a nucleotide sequence substantially as 
shown in SEQ. ID. NO. 2. 

5 

2. A nucleic acid comprising a nucleotide sequence substantially 
complementary to the sequence of Claim 1 . 

3. The nucleic acid of Claim 1 . further comprising the nucleotide sequence 
10 of residues number 639 through 692 of SEQ. ID. NO. 2. 

4. The nucleic acid of Claim 1. further comprising a synthetic 
oligonucleotide. 

"•S 5. A recombinant nucleic acid vector comprising the nucleic acid 

sequence of Claim 1 operably linked to a promoter nucleotide sequence. 

6. A polypeptide encoded by the nucleic acid of Claim 1 . 

20 7. The nucleic acid of Claim 1 capable of encoding an antisense RNA. 

8. A recombinant nucleic acid vector comprising the nucleic acid 
sequence of Claim 2 operably linked to a promoter nucleotide sequence. 

2S 9- A nucleic acid comprising a nucleotide sequence substantially as 

shown in SEQ. ID. NO. 4. 

10. A nucleic acid comprising a nucleotide sequence substantially 
complementary to the sequence of Claim 9. 

30 

11. The nucleic acid of Claim 9, further comprising a synthetic 
oligonucleotide. 

12. A polypeptide having an amino acid sequence substantially as shown 
35 in SEQ. ID. NO. 1. 
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13. A polypeptide having an amino acid sequence substantially as shown 
in SEQ. ID. NO. 3. 

14. A polypeptide comprising the amino acid sequence 
5 RiRiQRiRgPPRiRaRaRiPPRgRiQRiR,, wherein R, is any amino acid, Rg is lysine or 

arginine, Q is glutamine, and P is proline. 

15. The polypeptide of Claim 14, wherein the polypeptide comprises the 
amino acid sequence NH3-AVQSKPPSKRDPPKMQTD-COO-. 

0 

16. A process for inhibiting synthesis of a defense protein in a plant 
comprising introducing the nucleic acid sequence of Claim 2 into a plant cell and 
growing the cell into a plant. 



15 17. A transgenic plant produced by the process of Claim 1 6, or by breeding 

the plant of Claim 16 with a second plant, 

18. A transgenic plant comprising a recombinant nucleic acid that encodes 
systemin or prosystemin. 

20 

19. The transgenic plant of Claim 18, further comprising a plant selected 
from the group consisting essentially of tomato, potato, tobacco, alfalfa, rape, 
soybean, corn, rice, wheat, barley, cotton, poplar tree, pine and fir tree. 



25 20. A method of enhancing the defense mechanism of a plant comprising 

the step of introducing to the plant a nucleic acid coding for a polypeptide hormone 
which stimulates the production of a defense protein in the plant. 



30 



21 . The method of Claim 20, wherein the polypeptide hormone is systemin. 

22. Substantially pure systemin transcript. 



23. The DNA template sequence for the transcript of Claim 22. 



BNSDOCID: <WO 9319079A1J_> 



wo 93/19079 PCr/US93/02428 

-44- 

24. The DNA sequence substantially complementary to the template 
sequence of Claim 22. 

25. The amino acid sequence translated from the transcript of Claim 22. 

5 

26. The transcript of Claim 22. wherein the transcript is not of in vivo origin. 

27. A method of increasing the level of polypeptide hormone in a plant 
comprising the step of introducing a recombinant nucleic acid coding for the 

1 0 polypeptide hormone. 

28. The method of Claim 27. wherein the polypeptide hormone is systemin. 
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Met Gly Thr Pro Ser Tyr Asp lie Lys Asn Lys Gly Asp Asp Met Gin 
15 10 15 

Glu Glu Pro Lys Val Lys Leu His His Glu Lys Gly Gly Asp Glu Lys 
20 25 30 

Glu Lys lie lie Glu Lys Glu Thr Pro Ser Gin Asp He Asn Asn Lys 
35 40 45 

Asp Thr He Ser Ser Tyr Val Leu Arg Asp Asp Thr Gin Glu He Pro 
50 55 60 

Lys Met Glu His Glu Glu Gly Gly Tyr Val Lys Glu Lys He Val Glu 
S5 70 75 80 

Lys Glu Thr He Ser Gin Tyr He He Lys He Glu Gly Asp Asp Asp 
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Glu Lys He Val Glu Lys Glu Thr Pro Ser Gin Asp He Asn Asn Lys 
115 120 125 

Gly Asp Asp Ala Gin Glu Lys Pro Lys Val Glu His Glu Glu Glv Aso 
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Ala Leu Glu He Thr Lys Val Val Cys Glu Lys He He Val Arg Glu 
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Asp Leu Ala Val Gin Ser Lys Pro Pro Ser Lys Arg Asp Pro Pro Lys 
180 185 190 

Met Gin Thr Asp Asn Asn Lys Leu 
195 200 



FIGURE 6 
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